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Description 

Field of the Invention 



This invention relates to newly identified polypeptides and polynucleotides encoding such polypeptides, to their 
use in therapy and in identifying compounds which may be agonists, antagonists and /or inhibitors which are potentially 
useful in therapy, and to production of such polypeptides and polynucleotides. 

Background of the Invention 



The drug discovery process is currently undergoing a fundamental revolution as it embraces functional genomics', 
that is, high throughput genome- or gene-based biology. This approach as a means to identify genes and gene products 
as therapeutic targets is rapidly superceding earlier approaches based on 'positional cloning'. A phenotype that is a 
biological function or genetic disease, would be identified and this would then be tracked back to the responsible gene 
15 based on its genetic map position. 

Functional genomics relies heavily on high-throughput DNA sequencing technologies and the various tools of 
bioinformatics to identify gene sequences of potential interest from the many molecular biology databases now avail- 
able. There is a continuing need to identify and characterise further genes and their related polypeptides/broteins, as 
targets for drug discovery. 



Summary of the Invention 

The present invention relates to CSB2, in particular CSB2 polypeptides and CSB2 polynucleotides, recombinant 
materials and methods for their production. In another aspect, the invention relates to methods for using such polypep- 
tides and polynucleotides, including the treatment of cancer, heart hypertrophy, heart failure, ischaemia, arrythmia, 
hypertension, atherosclerosis, restenosis, chronic and acute inflammation, cerebral stroke, brain diseases, rheumatoid 
arthritis, Romano-Ward syndrome, generalised gangliosidis type I, Sotos syndrome and Laurence-Moon-Bardet-Bield 
syndrome, hereinafter referred to as "the Diseases", amongst others. In a further aspect, the invention relates to meth- 
ods for identifying agonists and antagonists/inhibitors using the materials provided by the invention, and treating con- 
ditions associated with CSB2 imbalance with the identified compounds. In a still further aspect, the invention relates 
to diagnostic assays for detecting diseases associated with inappropriate CSB2 activity or levels. 

Description of the Invention 

In a first aspect, the present invention relates to CSB2 polypeptides. Such peptides include isolated polypeptides 
comprising an amino acid sequence which has at least 70% identity, preferably at least 80% identity, more preferably 
at least 90% identity, yet more preferably at least 95% identity, most preferably at least 97-99% identity, to that of SEQ 
ID NO:2 or SEQ ID NO:6 over the entire length of SEQ ID NO:2 or SEQ ID NO:6. Such polypeptides include those 
comprising the amino acid of SEQ ID NO:2 or SEQ ID NO:6. 

Further peptides of the present invention include isolated polypeptides in which the amino acid sequence has at 
least 70% identity, preferably at least 80% identity, more preferably at least 90% identity, yet more preferably at least 
95% identity, most preferably at least 97-99% identity, to the amino acid sequence of SEQ ID NO:2 or SEQ ID NO'6 
over the entire length of SEQ ID NO:2 or SEQ ID NO:6. Such polypeptides include the polypeptide of SEQ ID N0 2 
and SEQ ID NO:6. 

Further peptides of the present invention include isolated polypeptides encoded by a polynucleotide comprisinq 
the sequence contained in SEQ ID NO: 1 or SEQ ID NO:5. 

The polypeptides shown in SEQ ID NO:2 and SEQ ID NO:6 are the products of alternatively spliced primary tran- 
scripts, comprising the cDNA sequences given in SEQ ID NO: 1 and SEQ ID NO:5. 

Polypeptides of the present invention are members of the guanine nucleotide exchange factor (GEF) family of 
polypeptides. They are therefore of interest because GEF proteins regulate Ras superfamily GTPases which are in- 
volved in cytoskeletal organisation, cell growth and proliferation, cell migration, cell adhesion, membrane trafficking 
vesicles transport, mitosis, ADP-ribosylation and phospholipid metabolism . These properties are hereinafter referred 
to as CSB2 activity- or CSB2 polypeptide activity" or "biological activity of CSB2". Also included amongst these activities 
are antigenic and immunogenic activities of said CSB2 polypeptides, in particular the antigenic and immunogenic 
activities of the polypeptide of SEQ ID NO:2 or SEQ ID NO:6. Preferably, a polypeptide of the present invention exhibits 
at least one biological activity of CSB2. 

The polypeptides of the present invention may be in the form of the "mature" protein or may be a part of a larger 
protein such as a fusion protein. It is often advantageous to include an additional amino acid sequence which contains 
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secretory or leader sequences, pro-sequences, sequences which aid in purification such as multiple histidine residues, 
. or an additional sequence for stability during recombinant production. 

The present invention also includes include variants of the aforementioned polypeptides, that is polypeptides that 
vary from the referents by conservative amino acid substitutions, whereby a residue is substituted by another with like 
5 characteristics. Typical such substitutions are among Ala, Val, Leu and lie; among Ser and Thr, among the acidic 
residues Asp and Glu; among Asn and Gin; and among the basic residues Lys and Arg; or aromatic residues Phe and 
Tyr. Particularly preferred are variants in which several, 5-10, 1-5, 1-3, 1-2 or 1 amino acids are substituted, deleted, 
or added in any combination. 

Polypeptides of the present invention can be prepared in any suitable manner Such polypeptides include isolated 
10 naturally occurring polypeptides, recombinantly produced polypeptides, synthetically produced polypeptides, or 
polypeptides produced by a combination of these methods. Means for preparing such polypeptides are well understood 
in the art. 

In a further aspect, the present invention relates to CSB2 polynucleotides. Such polynucleotides include isolated 
polynucleotides comprising a nucleotide sequence encoding a polypeptide which has at least 70% identity, preferably 

15 at least 80% identity, more preferably at least 90% identity, yet more preferably at least 95% identity, to the amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:6, over the entire length of SEQ ID NO:2 or SEQ ID NO:6. In this regard, 
polypeptides which have at least 97% identity are highly preferred, whilst those with at least 98-99% identity are more 
highly preferred, and those with at least 99% identity are most highly preferred. Such polynucleotides include a poly- 
nucleotide comprising the nucleotide sequence contained in SEQ ID NO:1 or SEQ ID NO:5 encoding the polypeptide 

20 of SEQ ID NO:2 or SEQ ID NO:6 respectively. 

Further polynucleotides of the present invention include isolated polynucleotides comprising a nucleotide sequence 
that has at least 70% identity, preferably at least 80% identity, more preferably at least 90% identity, yet more preferably 
at least 95% identity, to a nucleotide sequence encoding a polypeptide of SEQ ID NO:2 or SEQ ID NO:6, over the 
entire coding region. In this regard, polynucleotides which have at least 97% identity are highly preferred, whilst those 

25 with at least 98-99% identity are more highly preferred, and those with at least 99% identity are most highly preferred. 

Further polynucleotides of the present invention include isolated polynucleotides comprising a nucleotide sequence 
which has at least 70% identity, preferably at least 80% identity, more preferably at least 90% identity, yet more pref- 
erably at least 95% identity, to SEQ ID NO:1 or SEQ ID NO:5 over the entire length of SEQ ID NO: 1 or SEQ ID NO: 
5. In this regard, polynucleotides which have at least 97% identity are highly preferred, whilst those with at least 98-99% 

30 identiy are more highly preferred, and those with at least 99% identity are most highly preferred. Such polynucleotides 
include a polynucleotide comprising the polynucleotide of SEQ ID NO:1 or SEQ ID NO:5 as well as the polynucleotide 
of SEQ ID NO:1 or SEQ ID NO:5. 

The invention also provides polynucleotides which are complementary to all the above described polynucleotides. 
The nucleotide sequence of SEQ ID NO: shows homology with Netl (Chan, AM et al, Oncogene 1 2(6) p1 259-1 266, 

35 1996). The nucleotide sequence of SEQ ID NO:1 is a cDNA sequence and comprises a polypeptide encoding sequence 
(nucleotide 134 to 1639) encoding a polypeptide of 526 amino acids, the polypeptide of SEQ ID NO:2. The nucleotide 
sequence encoding the polypeptide of SEQ ID NO:2 may be identical to the polypeptide encoding sequence contained 
in SEQ ID NO:1 or it may be a sequence other than the one contained in SEQ ID NO:1 , which, as a result of the 
redundancy (degeneracy) of the genetic code, also encodes the polypeptide of SEQ ID NO:2. The polypeptide of. the 

40 SEQ ID NO:2 is structurally related to other proteins of the guanine nucleotide exchange factor (GEF) family, having 
homology and/or structural similarity with Netl (Chan, AM et al, Oncogene 12(6) p1 259-1266, 1996). 

The nucleotide sequence of SEQ ID NO:5 shows homology with Netl (Chan, AM et al, Oncogene 1 2(6) p1 259-1 266, 
1 996). The nucleotide sequence of SEQ ID NO:5 is a cDN A sequence and comprises a polypeptide encoding sequence 
(nucleotide 73 to 1746) encoding a polypeptide of 558 amino acids, the polypeptide of SEQ ID NO:6. The nucleotide 

45 sequence encoding the polypeptide of SEQ ID NO:6 may be identical to the polypeptide encoding sequence contained 
in SEQ ID NO:5 or it may be a sequence other than the one contained in SEQ ID NO:5, which, as a result of the 
redundancy (degeneracy) of the genetic code, also encodes the polypeptide of SEQ ID NO:6. The polypeptide of the 
SEQ ID NO:6 is structurally related to other proteins of the guanine nucleotide exchange factor (GEF) family, having 
homology and/or structural similarity with Netl (Chan, AM et al, Oncogene 12(6) p1 259-1 266, 1996). 

50 Preferred polypeptides and polynucleotides of the present invention are expected to have, inter alia, similar bio- 

logical functions/properties to their homologous polypeptides and polynucleotides. Furthermore, preferred polypeptides 
and polynucleotides of the present invention have at least one CSB2 activity. 

The present invention also relates to partial or other polynucleotide and polypeptide sequences which were first 
identified prior to the determination of the corresponding full length sequences of SEQ ID NO:1 and SEQ ID NO:5 and 

55 their corresponding polypeptide sequences SEQ ID NO:2 and SEQ ID NO:6. 

Accordingly, in a further aspect, the present invention provides for an isolated polynucleotide which: 

. (a) comprises a nucleotide sequence which has at least 70% identity, preferably at least 80% identity, more pref- 
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erably at least 90% identity, yet more preferably at least 95% identity, even more preferably at least 97-99% identity 
to SEQ ID NO:3 over the entire length of SEQ ID NO:3; 

(b) has a nucleotide sequence which has at least 70% identity, preferably at least 80% identity, more preferably 
at least 90% identity, yet more preferably at least 95% identity, even more preferably at least 97-99% identity, to 

5 SEQ ID NO:3 over the entire length of SEQ ID NO:3; 

(c) the polynucleotide of SEQ ID NO:3; or 

(d) a nucleotide sequence encoding a polypeptide which has at least 70% identity, preferably at least 80% identity, 
more preferably at least 90% identity, yet more preferably at least 95% identity, even more preferably at least 
97-99% identity, to the amino acid sequence of SEQ ID NO:4, over the entire length of SEQ ID NO:4; 

w 

as well as the polynucleotide of SEQ ID NO:3. 

The present invention further provides for a polypeptide which: 

(a) comprises an amino acid sequence which has at least 70% identity, preferably at least 80% identity, more 
'5 preferably at least 90% identity, yet more preferably at least 95% identity, most preferably at least 97-99% identity, 

to that of SEQ ID NO:4 over the entire length of SEQ ID NO:4; 

(b) has an amino acid sequence which is at least 70% identity, preferably at least 80% identity, more preferably at 
least 90% identity, yet more preferably at least 95% identity, most preferably at least 97-99% identity, to the amino 
acid sequence of SEQ ID NO:4 over the entire length of SEQ ID NO:4; 

20 ( C ) comprises the amino acid of SEQ ID NO:4; and 

(d) is the polypeptide of SEQ ID NO:4; 

as well as polypeptides encoded by a polynucleotide comprising the sequence contained in SEQ ID NO:3. 

The nucleotide sequence of SEQ ID NO:3 and the peptide sequence encoded thereby are derived from EST 

25 (Expressed Sequence Tag) sequences. It is recognised by those skilled in the art that there will inevitably be some 
nucleotide sequence reading errors in EST sequences (see Adams, M.D. era/, Nature 377 (supp) 3, 1 995). Accordingly, 
the nucleotide sequence of SEQ ID NO:3 and the peptide sequence encoded therefrom are therefore subject to the 
same inherent limitations in sequence accuracy. Furthermore, the peptide sequence encoded by SEQ ID NO:3 com- 
prises a region of identity or close homology and/or close structural similarity (for example a conservative amino acid 

30 difference) with the closest homologous or structurally similar protein. 

Polynucleotides of the present invention may be obtained, using standard cloning and screening techniques, from 
a cDNA library derived from mRNA in cells of human brain, T cell lymphoma, melanoma, foetal dura mater, foetal heart, 
prostate, endometrial tumour, activated T cell, substantia nigra, testes, apoptotic T cell, foetal liver, multiple sclerosis, 
parathyroid tumour, activated monocyte, primary dendritic cells, adipose, resting T cell, lung, helper T cell, pancreas 

35 islet cell, using the expressed sequence tag (EST) analysis (Adams, M.D., et al Science (1991) 252:1651-1656; Ad- 
ams, M.D. era/., Nature, (1992) 355:632-634; Adams, M.D., era/., Nature (1995) 377 Supp:3-174). Polynucleotides 
of the invention can also be obtained from natural sources such as genomic DNA libraries or can be synthesized using 
well known and commercially available techniques. 

When polynucleotides of the present invention are used for the recombinant production of polypeptides of the 

40 present invention, the polynucleotide may include the coding sequence for the mature polypeptide, by itself; or the 
coding sequence for the mature polypeptide in reading frame with other coding sequences, such as those encoding a 
leader or secretory sequence, a pre-, or pro- or prepro- protein sequence, or other fusion peptide portions. For example, 
a marker sequence which facilitates purification of the fused polypeptide can be encoded. In certain preferred embod- 
iments of this aspect of the invention, the marker sequence is a hexa-histidine peptide, as provided in the pQE vector 

45 (Qiagen, Inc.) and described in Gentz era/., Proc Natl Acad Sci USA (1989) 86:821-824, or is an HA tag. The polynu- 
cleotide may also contain non-coding 5' and 3' sequences, such as transcribed, non-translated sequences, splicing 
and polyadenylation signals, ribosome binding sites and sequences that stabilize mRNA. 

Further embodiments of the present invention include polynucleotides encoding polypeptide variants which com- 
prise the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:6 and in which several, for instance from 5 to 10, 1 to 

50 5, 1 to 3, 1 to 2 or 1 , amino acid residues are substituted, deleted or added, in any combination. 

Polynucleotides which are identical or sufficiently identical to a nucleotide sequence contained in SEQ ID NO:1 or 
SEQ ID NO:5, may be used as hybridization probes for cDNA and genomic DNA or as primers for a nucleic acid 
amplification (PCR) reaction, to isolate full-length cDNAs and genomic clones encoding polypeptides of the present 
invention and to isolate cDNA and genomic clones of other genes (including genes encoding homologs and orthologs 

55 from species other than human) that have a high sequence similarity to SEQ ID NO: 1 or SEQ ID NO:5. Typically these 
nucleotide sequences are 70% identical, preferably 80% identical, more preferably 90% identical, most preferably 95% 
identical to that of the referent. The probes or primers will generally comprise at least 15 nucleotides, preferably, at 
least 30 nucleotides and may have at least 50 nucleotides. Particularly preferred probes will have between 30 and 50 
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nucleotides. 

A polynucleotide encoding a polypeptide of the present invention, including homologs and orthologs from species 
other than human, may be obtained by a process which comprises the steps of screening an appropriate library under 
stringent hybridization conditions with a labeled probe having the sequence of SEQ ID NO: 1 , SEQ ID NO:5 or fragments 

s thereof; and isolating full-length cDNA and genomic clones containing said polynucleotide sequence. Such hybridiza- 
tion techniques are well known to the skilled artisan: Preferred stringent hybridization conditions include overnight 
incubation at 42°C in a solution comprising: 50% formamide, 5xSSC (150mM NaCI, 15mM trisodium citrate), 50 mM 
sodium phosphate (pH7.6), 5x Denhardt's solution, 10 % dextran sulfate, and 20 microgram/ml denatured, sheared 
salmon sperm DNA; followed by washing the filters in 0. Ix SSC at about 65°C. Thus the present invention also includes 

10 polynucleotides obtainable by screening an appropriate library under stingent hybridization conditions with a labeled 
probe having the sequence of SEQ ID NO:1 , SEQ ID NO:5 or fragments thereof. 

The skilled artisan will appreciate that, in many cases, an isolated cDNA sequence will be incomplete, in that the 
region coding for the polypeptide is cut short at the 5' end of the cDNA. This is a consequence of reverse transcriptase, 
an enzyme with inherently low 'processivity' (a measure of the ability of the enzyme to remain attached to the template 

15 during the polymerisation reaction), failing to complete a DNA copy of the mRNA template during 1st strand cDNA 
synthesis. 

There are several methods available and well known to those skilled in the art to obtain full-length cDNAs, or 
extend short cDNAs, for example those based on the method of Rapid Amplification of cDNA ends (RACE) (see, for 
example, Frohman etal., PNAS USA 85, 8998-9002, 19B8). Recent modifications of the technique, exemplified by the 

20 Marathon™ technology (Clontech Laboratories Inc.) for example, have significantly simplified the search for longer 
cDNAs. In the Marathon™ technology, cDNAs have been prepared from mRNA extracted from a chosen tissue and 
an 'adaptor' sequence ligated onto each end. Nucleic acid amplification (PCR) is then carried out to amplify the 'missing' 
5' end of the cDN A using a combination of gene specific and adaptor specific oligonucleotide primers. The PCR reaction 
is then repeated using 'nested' primers, that is, primers designed to anneal within the amplified product (typically an 

25 adaptor specific primer that anneals further 3' in the adaptor sequence and a gene specific primer that anneals further 
5' in the known gene sequence). The products of this reaction can then be analysed by DNA sequencing and a full- 
length cDNA constructed either by joining the product directly to the existing cDNA to give a complete sequence, or 
carrying out a separate full-length PCR using the new sequence information for the design of the 5' primer. 

Recombinant polypeptides of the present invention may be prepared by processes well known in the art from 

30 genetically engineered host cells comprising expression systems. Accordingly, in a further aspect, the present invention 
relates to expression systems which comprise a polynucleotide or polynucleotides of the present invention, to host 
cells which are genetically engineered with such expression sytems and to the production of polypeptides of the in- 
vention by recombinant techniques. Cell-free translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. 

35 For recombinant production, host cells can be genetically engineered to incorporate expression systems or portions 

thereof for polynucleotides of the present invention. Introduction of polynucleotides into host cells can be effected by 
methods described in many standard laboratory manuals, such as Davis et al., Basic Methods in Molecular Biology 
(1986) and Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y. (1989). Preferred such methods include, for instance, calcium phosphate transfection, DEAE- 

40 dextran mediated transfection, transvection, microinjection, cationic lipid-mediated transfection, electroporation, trans- 
duction, scrape loading, ballistic introduction or infection. 

Representative examples of appropriate hosts include bacterial cells, such as Streptococci, Staphylococci, E. coli, 
Streptomyces and Bacillus subtilis cells; fungal cells, such as yeast cells and Aspergillus cells; insect cells such as 
Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa, C1 27, 3T3, BHK, HEK 293 and Bowes 

45 melanoma cells; and plant cells. 

A great variety of expression systems can be used, for instance, chromosomal, episomal and virus-derived sys- 
tems, e.g., vectors derived from bacterial plasmids, from bacteriophage, from transposons, from yeast episomes, from 
insertion elements, from yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such as 
SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived 

50 . from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, such as cosmids 
and phagemids. The expression systems may contain control regions that regulate as well as engender expression. 
Generally, any system or vector which is able to maintain, propagate or express a polynucleotide to produce a polypep- 
tide in a host may be used. The appropriate nucleotide sequence may be inserted into an expression system by any 
of a variety of well-known and routine techniques, such as, for example, those set forth in Sambrook et al., Molecular 

55 Cloning, A Laboratory Manual (supra). Appropriate secretion signals may be incorporated into the desired polypeptide 
to allow secretion of the translated protein into the lumen of the endoplasmic reticulum, the periplasmic space or the 
extracellular environment. These signals may be endogenous to the polypeptide or they may be heterologous signals. 
If a polypeptide of the present invention is to be expressed for use in screening assays, it is generally preferred 
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that the polypeptide be produced at the surface ot the cell. In this event, the cells may be harvested prior to use in the 
screening assay. If the polypeptide is secreted into the medium, the medium can be recovered in order to recover and 
purify the polypeptide. If produced intracellularly, the cells must first be lysed before the polypeptide is recovered 

Polypeptides of the present invention can be recovered and purified from recombinant cell cultures by well-known 
methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatog- 
raphy, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydrox- 
ylapatite chromatography and lectin chromatography. Most preferably, high performance liquid chromatography is em- 
ployed for purification. Well known techniques for refolding proteins may be employed to regenerate active conformation 
when the polypeptide is denatured during isolation and or purification. 

This invention also relates to the use of polynucleotides of the present invention as diagnostic reagents Detection 
of a mutated form of the gene characterised by the polynucleotide of SEQ ID NO:1 or SEQ ID NO 5 which is associated 
with a dysfunction will provide a diagnostic tool that can add to, or define, a diagnosis of a disease, or susceptibility to 
adisease, which results from under-expression.over-expression or altered expression of the gene. Individuals carryinq 
mutations in the gene may be detected at the DNA level by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from a subject's cells, such as from blood, urine, saliva, tissue biopsy 
or autopsy material. The genomic DNA may be used directly for detection or may be amplified enzymatically by using 
PCR or other amplification techniques prior to analysis. RNA or cDNA may also be used in similar fashion Deletions 
and insertions can be detected by a change in size of the amplified product in comparison to the normal genotype 
Point mutations can be identified by hybridizing amplified DNA to labeled CSB2 nucleotide sequences Perfectly 
matched sequences can be distinguished from mismatched duplexes by RNase digestion or by differences in melting 
temperatures. DNA sequence differences may also be detected by alterations in electrophoretic mobility of DNA frag- 
ments in gels, with or without denaturing agents, or by direct DNA sequencing (ee, e.g.. Myers et al , Science (1 985) 
230:1242). Sequence changes at specific locations may also be revealed by nuclease protection assays, such as 
RNase and S1 protection or the chemical cleavage method (see Cotton et al., Proc Natl Acad Sci USA (1985)85- 
43974401). In another embodiment, an array of oligonucleotides probes comprising CSB2 nucleotide sequence or 
fragments thereof can be constructed to conduct efficient screening of e.g., genetic mutations. Array technology meth- 
ods are well known and have general applicability and can be used to address a variety of questions in molecular 
genetics including gene expression, genetic linkage, and genetic variability (see for example: M Chee et al Science 
Vol 274, pp 610-613 (1996)). 

The diagnostic assays offer a process for diagnosing or determining a susceptibility to the Diseases through de- 
tection of mutation in the CSB2 gene by the methods described. In addition, such diseases may be diagnosed by 
methods comprising determining from a sample derived from a subject an abnormally decreased or increased level of 
polypeptide or mRNA. Decreased or increased expression can be measured at the RNA level using any of the methods 
well known in the art for the quantitation of polynucleotides, such as, for example, nucleic acid amplification, for instance 
PCR, RT-PCR, RNase protection. Northern blotting and other hybridization methods. Assay techniques that can be 
used to determine levels of a protein, such as a polypeptide of the present invention, in a sample derived from a host 
are well-known to those of skill in the art. Such assay methods include radioimmunoassays, competitive-bindinq assays 
Western Blot analysis and ELISA assays. ' 
Thus in another aspect, the present invention relates to a diagonostic kit which comprises: 

(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ ID NO- 1 SEQ ID N05 
or fragments thereof; 

(b) a nucleotide sequence complementary to that of (a); 

(c) a polypeptide of the present invention, preferably the polypeptide of SEQ ID NO:2, SEQ ID NO 6 or fraqments 
thereof; or 8 

(d) an antibody to a polypeptide of the present invention, preferably to the polypeptide of SEQ ID NO:2 or SEQ ID 
NO;6. 



It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial component. Such a kit will 
be of use in diagnosing a disease or suspectability to a disease, particularly cancer, heart hypertrophy, heart failure 
ischaemia, arrythmia, hypertension, atherosclerosis, restenosis, chronic and acute inflammation, cerebral stroke brain 
d.seases, rheumato.d arthritis, Romano-Ward syndrome, generalised gangliosidis type I, Sotos syndrome and Lau- 
rence-Moon-Bardet-Bield syndrome, among others. 

The nucleotide sequences of the present invention are also valuable for chromosome localisation The sequence 
is specifically targeted to, and can hybridize with, a particular location on an individual human chromosome The map- 
ping of relevant sequences to chromosomes according to the present invention is an important first step in correlating 
those sequences with gene associated disease. Once a sequence has been mapped to a precise chromosomal loca- 
tion, the physical position of the sequence on the chromosome can be correlated with genetic map data Such data 
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are found in, for example, V. McKusick, Mendelian Inheritance in Man (available on-line through Johns Hopkins Uni- 
versity Welch Medical Library). The relationship between genes and diseases that have been mapped to the same 
chromosomal region are then identified through linkage analysis (coinheritance of physically adjacent genes). 

The differences in the cDNA or genomic sequence between affected and unaffected individuals can also be de- 
5 termined. If a mutation is observed in some or all of the affected individuals but not in any normal individuals, then the 
mutation is likely to be the causative agent of the disease. 

The gene of the present invention maps to human chromosome 3p21-p14 between markers D3S 1300 and 
D3S1600. 

The nucleotide sequences of the present invention are also valuable for tissue localisation. Such techniques allow 

10 the determination of expression patterns of the CSB2 polypeptides in tissues by detection of the mRNAs that encode 
them. These techniques include in situ hybridziation techniques and nucleotide amplification techniques, for example 
PCR. Such techniques are well known in the art. Results from these studies provide an indication of the normal functions 
of the polypeptides in the organism. In addition, comparative studies of the normal expression pattern of CSB2 mRNAs 
with that of mRNAs encoded by a mutant CSB2 gene provide valuable insights into the role of mutant CSB2 polypep- 

15 tides, or that of inappropriate expression of normal CSB2 polypeptides, in disease. Such inappropriate expression may 
be of a temporal, spatial or simply quantitative nature. 

The polypeptides of the invention or their fragments or analogs thereof, or cells expressing them, can also be used 
as immunogens to produce antibodies immunospecific for polypeptides of the present invention. The term "immuno- 
specific" means that the antibodies have substantially greater affinity for the polypeptides of the invention than their 

20 affinity for other related polypeptides in the prior art. 

Antibodies generated against polypeptides of the present invention may be obtained by administering the polypep- 
tides or epitope-bearing fragments, analogs or cells to an animal, preferably a non-human animal, using routine pro- 
tocols. For preparation of monoclonal antibodies, any technique which provides antibodies produced by continuous 
cell line cultures can be used. Examples include the hybridoma technique (Kohler, G. and Milstein, C, Nature (1975) 

25 256:495497), the trioma technique, the human B-cell hybridoma technique (Kozbor et ai, Immunology Today (1983) 
4:72) and the EBV-hybridoma technique (Cole er ai, Monoclonal Antibodies and Cancer Therapy, 77-96, Alan R. Liss, 
Inc., 1985). 

Techniques for the production of single chain antibodies, such as those described in U.S. Patent No. 4,946,778, 
can also be adapted to produce single chain antibodies to polypeptides of this invention. Also, transgenic mice, or 
30 other organisms, including other mammals, may be used to express humanized antibodies. 

The. above-described antibodies may be employed to isolate or to identify clones expressing the polypeptide or to 
purify the polypeptides by affulity chromatography. 

Antibodies against polypeptides of the present invention may also be employed to treat the Diseases, amongst 
others. 

55 in a further aspect, the present invention relates to genetically engineered soluble fusion proteins comprising a 

polypeptide of the present invention, or a fragment thereof, and various portions of the constant regions of heavy or 
light chains of immunoglobulins of various subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the 
constant part of the heavy chain of human IgG, particularly lgG1, where fusion takes place at the hinge region. In a 
particular embodiment, the Fc part can be removed simply by incorporation of a cleavage sequence which can be 

40 cleaved with blood clotting factor Xa. Furthermore, this invention relates to processes for the preparation of these 
fusion proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and therapy. A further 
aspect of the invention also relates to polynucleotides encoding such fusion proteins. Examples of fusion protein tech- 
nology can be found in International Patent Application Nos. W094/29458 and W094/22914. 

Another aspect of the invention relates to a method for inducing an immunological response in a mammal which 

45 comprises inoculating the mammal with a polypeptide of the present invention, adequate to produce antibody and/or 
T cell immune response to protect said animal from the Diseases hereinbefore mentioned, amongst others. Yet another 
aspect of the invention relates to a method of inducing immunological response in a mammal which comprises, deliv- 
ering a polypeptide of the present invention via a vector directing expression of the polynucleotide and coding for the 
polypeptide in vivo in order to induce such an immunological response to produce antibody to protect said animal from 

so diseases. 

A further aspect of the invention relates to an immunological/vaccine formulation (composition) which, when intro- 
duced into a mammalian host, induces an immunological response in that mammal to a polypeptide of the present 
invention wherein the composition comprises a polypeptide or polynucleotide of the present invention. The vaccine 
formulation may further comprise a suitable carrier. Since a polypeptide may be broken down in the stomach, it is 
55 preferably administered parenterally (for instance, subcutaneous, intramuscular, intravenous, or intradermal injection). 
Formulations suitable for parenteral administration include aqueous and non-aqueous sterile injection solutions which 
may contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation instonic with the blood of 
the recipient; and aqueous and non-aqueous sterile suspensions which may include suspending agents or thickening 
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agents. The formulations may be presented in unitOose or multi-dose containers, tor example, sealed ampoules and 
vials and may be stored in a treeze-dried condition requiring only the addition ot the sterile liquid carrier immediately 
prior to use. The vaccine formulation may also include adjuvant systems for enhancing the immunogenicity of the 
formulatiorv such as o.l-.n water systems and other systems known in the art. The dosage will depend on the specific 
activity of the vaccine and can be readily determined by routine experimentation 

Polypeptides of the present invention are responsible for one or more biological functions, including one or more 
disease states, in particular the Diseases hereinbefore mentioned. It is therefore desirous to devise screening methods 
o identify compounds which stimulate or which inhibit the function of the polypeptide. Accordingly, in a further aspect 
he present invention provides for a method of screening compounds to identify those which stimulate or which inhibii 
the function of the polypept.de. In general, agonists or antagonists may be employed for therapeutic and prophylactic 
purposes for such Diseases as hereinbefore mentioned. Compounds may be identified from a variety of sources for 
example, cells, ce -free preparations, chemical libraries, and natural product mixtures. Such agonists, antagonists or 
inhibitors so-,dent,f.ed may be natural or modified substrates, ligands, receptors, enzymes, etc., as the case may be 

^m^^sTm^ 181 ° r ,unc ' ional mime " cs ' hereof (see Coligan et a/ " Current Pro,ocols in lmmu " 

The screening method may simply measure the binding of a candidate compound to the polypeptide or to cells 
or membranes bearing the polypeptide, or a fusion protein thereof by means of a label directly or indirectly associated 
with the candidate compound. Alternatively, the screening method may involve competition with a labeled competitor 

nr U S t e ,r 89n ! n9 m6 II 10dS ^ ,6St Wh6,her ,hS Candida,e results in a signal generated by activation 

or inhibition of the polypeptide, using detection systems appropriate to the cells bearing the polypeptide Inhibitors of 
activation are generally assayed in the presence of a known agonist and the effect on activation by the agonist by the 
presence of the candidate compound is observed. Constitutive^ active polypeptides may be employed in screening 
methods for inverse agonists or inhibitors, in the absence of an agonist or inhibitor, by testing whether the candidate 
compound results ,n mhibrtion of activation of the polypeptide. Further, the screening methods may simply comprise 
me steps of mixing a candidate compound with a solution containing a polypeptide of the present invention, to form a 
mixture, measuring CSB2 activity in the mixture, and comparing the CSB2 activity of the mixture to a standard Fusion 
proteins, such as those made from Fc portion and CSB2 polypeptide, as hereinbefore described, can also be used for 
high-throughput screening assays to identify antagonists for the polypeptide of the present invention (see D Bennett 
st ai, J Mol Recognition, 8:52-58 (1995); and K. Johanson era/., J Biol Chem, 270(16) 9459-9471 (1995)) 

The polynucleotides, polypeptides and antibodies to the polypeptide of the present invention may also be used to 
configure screening methods for detecting the effect of added compounds on the production of mRNA and polypeptide 
in cells. For example, an ELISA assay may be constructed for measuring secreted or cell associated levels of polypep- 
tide using i monoclonal and polyclonal antibodies by standard methods known in the an. This can be used to discover 

S?I 2? ^'L 0 ' T hanCe thS P roduction °' Polypeptide (also called antagonist or agonist, respectively) 
from suitably manipulated cells or tissues. 

The polypeptide may be used to identify membrane bound or soluble receptors, if any, through standard receptor 

Shi? ?' qUe ?H kn °TK n , th , e art - ThSSe inC ' Ude ' bUt are n °' Nmi,ed t0 ' N 9 and bindin 9 and crosshnking assays in 
which the polypept.de is labeled with a radioactive isotope (for instance, '*!), chemically modified (for instance, bioti- 
ny lated), or fused to a peptide sequence suitable for detection or purification, and incubated with a source of the putative 
receptor (cells, cell membranes, cell supematants, tissue extracts, bodily fluids). Other methods include biophysical 
techniques such as surface plasmon resonance and spectroscopy. These screening methods may also be used to 
identify agonists and antagonists of the polypeptide which compete with the binding of the polypeptide to its receptors 
if any. Standard methods for conducting such assays are well understood in the art 

whi,hT,T P ! eS 1 P °, ,e . n, !f ' P ° u lyp , 6 P ,ide an,a 9°nists include antibodies or, in some cases, oligonucleotides or proteins 
which are closely related to the Uganda, substrates, receptors, enzymes, etc., as the case may be, of the polypeptide 
ag a fragment of the ligands, substrates, receptors, enzymes, etc.; or small molecules which bind to the potypeptide 
of the present invention but do not elicit a response, so that the activity of the polypeptide is prevented 

Thus, in another aspect, the present invention relates to a screening kit for identifying agonists, antagonists, lig- 

T^ aP T' I* 68 '. 9nZymeS ' etC ,0r P°'yP e P tides ° f th * P^sent invention; or compounds which decrease 
or enhance the production of such polypeptides, which comprises: 

(a) a polypeptide of the present invention; 

(b) a recombinant cell expressing a polypeptide of the present invention; 

(c) a cell membrane expressing a polypeptide of the present invention; or 

(d) antibody to a polypeptide of the present invention; 

which polypeptide is preferably that of SEQ ID NO:2 or SEQ ID NO:6. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial component. 
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It will be readily appreciated by the skilled artisan that a polypeptide of the present invention may also be used in 
a method for the structure-based design of an agonist, antagonist or inhibitor of the polypeptide, by: 

(a) determining in the first instance the three-dimensional structure of the polypeptide; 
5 (b) deducing the three-dimensional structure for the likely reactive or binding site(s) of an agonist, antagonist or 

inhibitor; 

(c) synthesing candidate compounds that are predicted to bind to or react with the deduced binding or reactive 
site; and 

(d) testing whether the candidate compounds are indeed agonists, antagonists or inhibitors. It will be further ap- 
io predated that this will normally be an iterative process. 

In a further aspect, the present invention provides methods of treating abnormal conditions such as, for instance, 
cancer, heart hypertrophy, heart failure, ischaemia, arrythmia, hypertension, atherosclerosis, restenosis, chronic and 
acute inflammation, cerebral stroke, brain diseases, rheumatoid arthritis, Romano-Ward syndrome, generalised gan- 

1S gliosidis type I, Sotos syndrome and Laurence-Moon -Bardet-Bield syndrome, related to either an excess of, or an 
under-expression of, CSB2 polypeptide activity. 

If the activity of the polypeptide is in excess, several approaches are available. One approach comprises admin- 
istering to a subject in need thereof an inhibitor compound (antagonist) as hereinabove described, optionally in com- 
bination with a pharmaceutical^ acceptable carrier, in an amount effective to inhibit the function of the polypeptide, 

20 such as, for example, by blocking the binding of ligands, substrates, receptors, enzymes, etc., or by inhibiting a second 
signal, and thereby alleviating the abnormal condition. In another approach, soluble forms of the polypeptides still 
capable of binding the ligand, substrate, enzymes, receptors, etc. in competition with endogenous polypeptide may be 
administered. Typical examples of such competitors include fragments of the CSB2 polypeptide. 

In still another approach, expression of the gene encoding endogenous CSB2 polypeptide can be inhibited using 

25 expression blocking techniques. Known such techniques involve the use of antisense sequences, either internally 
generated or externally administered (see, for example, O'Connor, J Neurochem (1991) 56:560 in Oligodeoxynucle- 
otidesas Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988)). Alternatively, oligonucleotides 
which form triple helices ("triplexes") with the gene can be supplied (see, for example, Lee et at., Nucleic Acids Res 
(1979) 6:3073; Cooney era/., Science (1988) 241:456; Dervan etal, Science (1991) 251:1360). These oligomers can 

30 be administered per se or the relevant oligomers can be expressed in vivo. Synthetic antisense or triplex oligonucle- 
otides may comprise modified bases or modified backbones. Examples of the latter include methytphosphonate, phos- 
phorothioate or peptide nucleic acid backbones. Such backbones are incorporated in the antisense or triplex oligonu- 
cleotide in order to provide protection from degradation by nucleases and are well known in the art. Antisense and 
triplex molecules synthesised with these or other modified backbones also form part of the present invention. 

35 in addition, expression of the CSB2 polypeptide may be prevented by using ribozymes specific to the CSB2 mRNA 

sequence. Ribozymes are catalytically active RN As that can be natural or synthetic (see for example Usman, N, et a!., 
Curr. Opin. Struct. Biol (1 996) 6(4), 527-33.) Synthetic ribozymes can be designed to specifically cleave CSB2 mRNAs 
at selected positions thereby preventing translation of the CSB2 mRNAs into functional polypeptide. Ribozymes may 
be synthesised with a natural ribose phosphate backbone and natural bases, as normally found in RNA molecules. 

40 Alternatively the ribosymes may be synthesised with non-natural backbones to provide protection from ribonuclease 
degradation, for example, 2'-0-methyl RNA, and may contain modified bases. 

For treating abnormal conditions related to an under-expression of CSB2 and its activity, several approaches are 
also available. One approach comprises administering to a subject a therapeutically effective amount of a compound 
which activates a polypeptide of the present invention, i.e., an agonist as described above, in combination with a 

45 pharmaceutical^ acceptable carrier, to thereby alleviate the abnormal condition. Alternatively, gene therapy may be 
employed to effect the endogenous production of CSB2 by the relevant cells in the subject. For example, a polynucle- 
otide of the invention may be engineered for expression in a replication defective retroviral vector, as discussed above. 
The retroviral expression construct may then be isolated and introduced into a packaging cell transduced with a ret- 
roviral plasmid vector containing RNA encoding a polypeptide of the present invention such that the packaging cell 

50 now produces infectious viral particles containing the gene of interest. These producer cells may be administered to 
a subject for engineering cells in vivo and expression of the polypeptide in vivo. For an overview of gene therapy, see 
Chapter 20, Gene Therapy and other Molecular Genetic-based Therapeutic Approaches, (and references cited therein) 
in Human Molecular Genetics, T Strachan and A P Read, BIOS Scientific Publishers Ltd (1996). Another approach is 
to administer a therapeutic amount of a polypeptide of the present invention in combination with a suitable pharma- 

55 ceutical carrier. 

In a further aspect, the present invention provides for pharmaceutical compositions comprising a therapeutically 
effective amount of a polypeptide, such as the soluble form of a polypeptide of the present invention, agonist/antagonist 
peptide or small molecule compound, in combination with a pharmaceutical ly acceptable carrier or excipient. Such 
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th3 tk ' . a ? "u 01 " mi,ed ,0 ' S9line ' buffered saline ' dex,rose ' water - 9'ycero). ethanol, and combinations 
thereof. The invent™ further relates to pharmaceutical packs and kits comprising one or more containers filled Zl 
oneormoreo the ingredtentsofthe aforementioned compositions of the invention. Polypeptides ando.her compounds 
of the present invent™ may be employed alone or in conjunction with other compounds such as therapeutic com- 

The composition will be adapted to the route of administration, for instance by a systemic or an oral route Preferred 
ZZt7 t adm,nis | ration include in i ecti0 ". tyPi^lly by intravenous injection. Other injection routes, such as 
subcutaneous, intramuscular, or intraperitoneal, can be used. Alternative means for systemic administration include 
ransmucosal and transdermal administration using penetrants such as bile salts or fusidic acids To heSemente 

ulld fol ? f yPe T 6 H ° r ° ,her COmP ° UndS °' ,he Pr8Sent inV6n,ion can be f « ed - ente* o a S 
anSoZT^T n Zl adm 7 ,s, ; ati0n ^ a ' S0 bS P ° SSib,e - Adm - s '^ion of these compounds may a,so be topSl 
and/or localized, in the form of salves, pastes, gels, and the like 

rnniJnf OH 83 "/ 3 ! 198 l**" deP6ndS °" the Ch ° iCe °' pep1ide 0r 0,ner impounds of the present invention the 
route of adm.mstrat.on the nature of the formulation, the nature of the subject's condition, and the judgment o the 

' SU ' table d0Sa9eS ' h ° WeVer ' are in ,he ran9e of °' 1 " 1 00 ^ ° f s ^ject. Wide vaSn Z the 

needed dosage, however, are to be expected in view of the variety of compounds available and the differing efficfenc es 
of various routes of administration. For example, oral administration woulo be expected to require 
administrate by intravenous injection. Variations in these dosage levels can oe adjusted using sVnoS empS 
routines for optimization, as is well understood in the art. =>ianaaro empirical 

Polypeptides used in treatment can also be generated endogenously in the subject, in treatment modalities often 

9ene h theraP n y ; aS d6SCribed abWe - Thus ' for exam P |e ' cells • Object may be^TS 

SSSSf" T a n 3 T ° f RNA ' ,0 ^ 3 P °^ P6ptide 9X V "°- and tor exam P le ' "V the use of a retroviral 
plasmid vector. The cells are then introduced into the subject 

SPO ,rpn y l UC J, e0tid ? T P °' ypep ' l l de seauences ,orm a valuable information resource with which to identify further 
Z Z uslT , TJ°? y f Th ' S ,S m ° St 6aS ^ ,aCilKated by S,0rin9 ,he seauence in a com P"^ readable medium 
GCG and I ~? 1 « ,0 f arCh 8 SeqUenCe da,3baSe USin9 We " kn0Wn searchi "9 to ols, »"* as those in the 
?S 9 S T ^l" 8968 A*"** ln 8 ,Urthef aSpSCt ' th6 pr6Sen, inven, ion provides for a computer 
readable medium having stored thereon a polynucleotide comprising the sequence of SEQ ID N0 1 or SEQ ID NO 5 
and/or a polypeptide sequence encoded thereby. 

The following definitions are provided to facilitate understanding of certain terms used frequently hereinbefore 
antih Q di«^!f ! a n Us f b f ein includes P o| yc'onal ana monoclonal antibodies, chimeric, single chain, and humanized 
antibodies as well as Fab fragments, including the products of an Fab or other immunoglobulin expression library 

Isolated" means altered "by the hand of man" from the natural state. If an "isolated composSion TsuSance 

ZZSTT . * PreSem ' n 8 liVin9 8nimal iS n0t >iS0la,ed '" but ,he same Polynucleotide or pofypeptide sep 
arated from the coexisting materials of its natural state is "isolated", as the term is employed herein 

rna ^°nZ Cle0t, ^ 9 !, n ^? lly r6ferS t0 any Polynucleotide or polydeoxribonucleotide, which may be unmodified 
Za Lu, ° r d f RNA ° r ° NA - Tolynucleotid ^" delude, without limitation, single- and double-stranded DMA 
DNA that .s a mixture of single- and double-stranded regions, single- and double-stranded RNA and RNA that Vs 

XTol P V ' d0Uble - S,randed ° r a mixture °' sin 9le- and double-stranded regions. In addition, ^nucleoli*- 
nZZZl " edre9IOnSCOmprisin9RNAorDNAorb °thRNAandDNATheterm "polynucleotide 

aZrl^l .M^ 8 ! n '!l?K ne ° r m0fe m0difi6d baSSS and DNAS ° r RNAs with backbones modified for stability or for 
o her reasons. Mod.fied" bases mclude, for example, tritylated bases and unusual bases such as inosine A varied 

SSSSLT Tt ,0 T?" RNA| ,hUS ' "P^'^ucleotide" embraces chemically, enzymatJfy or meS 
^2 f Polynucleotides as typically found in nature, as well as the chemical forms of DNA and RNA 

as oSS^r S9S 3 C " Po 'V"^leotide" also embraces relativefy short polynucleotides, often referred to 

b^ZnJS^ refe ( rs *° a "y P e P' id9 or P r °toin comprising two or more amino acids joined to each other by peptide 

to as oeotTS f ^ ■ \ *' ^ *** ° Polv P e Pt^" «*« to both short chains, common.y referred 

to as peptides, oligopeptides or oligomers, and to longer chains, generally referred to as proteins Polypeptides mav 

ESTE "t ° ,her ,han 20 9ene " encoded amino acid * "Polypeptides" include amino acid sequences m^ 
!1! I ,h P ? e c SSe K S ' !^ 85 post - translational Pressing, or by chemical mediation techniques which 
wen T£L ^ Such modifications are well described in basic texts and in more detailed monographs, as 

backb Q n B ,h! n ° US w SearCh J lter8tUra Modifications ma V occur anywhere in a pofypeptide, including the pep ide 
52E2J^rr° 3C ''ff . 8 ' nS 8nd 8min ° ° f Carb0Xy ' ,erminL 11 Wi " be appreciated that the same type of 
SvSl ^ b9 PreS9m ,0 ,hS ^ ° f Varyin9 de9rees at several si, os in a grven po.ypeptide. Also, a given 
polypeptide may contain many types of modifications. Polypeptides may be branched as a result of ubiquitina ion and 
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they may be cyclic, with or without branching. Cyclic, branched and branched cyclic polypeptides may result from post- 
translation natural processes or may be made by synthetic methods. Modifications include acetylation, acylation, ADP- 
ribosylation, amidation, biotinylation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment 

s of phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross- 
links, formation of cystine, formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing, phosphorylation, 
prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as 
arginylation, and ubiquitination (see, for instance, Proteins - Structure and Molecular Properties, 2nd Ed., T. E. Creight- 

10 on, W. H. Freeman and Company, New York, 1993; Wold, R, Post-translational Protein Modifications: Perspectives 
and Prospects, pgs. 1-12 in Post-translational Covalent Modification of Proteins, B. C. Johnson, Ed., Academic Press, 
New York, 1 983; Seifter et al., "Analysis for protein modifications and nonprotein cofactors", Meth Enzymol (1 990) 1 82: 
626-646 and Rattan et al., "Protein Synthesis: Post-translational Modifications and Aging", Ann NY Acad Sci (1992) 
663:48-62). 

'5 "Variant" refers to a polynucleotide or polypeptide that differs from a reference polynucleotide or polypeptide, but 

retains essential properties. Atypical variant of a polynucleotide differs in nucleotide sequence from another, reference 
polynucleotide. Changes in the nucleotide sequence of the variant may or may not alter the amino acid sequence of 
a polypeptide encoded. by the reference polynucleotide. Nucleotide changes may result in amino acid substitutions, 
additions, deletions, fusions and truncations in the polypeptide encoded by the reference sequence, as discussed 

20 below. A typical variant of a polypeptide differs in amino acid sequence from another, reference polypeptide. Generally, 
differences are limited so that the sequences of the reference polypeptide and the variant are closely similar overall 
and, in many regions, identical. A variant and reference polypeptide may differ in amino acid sequence by one or more 
substitutions, additions, deletions in any combination. A substituted or inserted amino acid residue may or may not be 
one encoded by the genetic code. A variant of a polynucleotide or polypeptide may be a naturally occurring such as 

25 an allelic variant, or it may be a variant that is not known to occur naturally. Non-naturally occurring variants of poly- 
nucleotides and polypeptides may be made by mutagenesis techniques or by direct synthesis. 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences or two or more poly- 
nucleotide sequences, as determined by comparing the sequences. In the art, "identity" also means the degree of 
sequence relatedness between polypeptide or polynucleotide sequences, as the case may be, as determined by the 

30 match between strings of such sequences. "Identity" and "similarity" can be readily calculated by known methods, 
including but not limited to those described in (Computational Molecular Biology, Lesk, A.M., ed., Oxford University 
Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New 
York, 1993; Computer Analysis of Sequence Data, Pari I, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New 
Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G:, Academic Press, 1987; and Sequence Analysis 

35 Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SI AM 
J. Applied Math., 48. 1073 (1988). Preferred methods to determine identity are designed to give the largest match 
between the sequences tested. Methods to determine identity and similarity are codified in publicly available computer 
programs. Preferred computer program methods to determine identity and similarity between two sequences include, 
but are not limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 387 (1984)), 

40 BLASTP, BLASTN, and FASTA (Atschul, S.R et al., J. Molec. Biol 215: 403-410 (1990). The BLAST X program is 
publicly available from NCBI and other sources (BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, MD 
20894; Altschul, S., etal, J. Mol. Biol. 215: 403-410 (1990). The well known Smith Waterman algorithm may also be 
used to determine identity. 

Preferred parameters for polypeptide sequence comparison include the following: 

45 

1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. Natl. Acad. Sci. USA. 89:10915-10919 
(1992) 

so Gap Penalty: 12 

Gap Length Penalty: 4 

A program useful with these parameters is publicly available as the "gap" program from Genetics Computer Group, 
Madison Wl . The aforementioned parameters are the default parameters for peptide comparisons (along with no penalty 
55 for end gaps). 

Preferred parameters for polynucleotide comparison include the following: 

1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
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Comparison matrix: matches = +10, mismatch = 0 
Gap Penalty: 50 
Gap Length Penalty: 3 

Available as: The "gap" program from Genetics Computer Group, Madison Wl. These are the default parameters 
for nucleic acid comparisons. H 

m f 3 eS; * p °!* moleoxide se ^ uence °» ,he P^sent invention may be identical to the reference sequence 

o SEQ I D NO: 1 or SEQ ID NO:5, that is be 100% identical, or it may include up to a certain integer number of nucleotide 
alterations as compared to the reference sequence. Such alterations are selected from the group consisting of at least 
one nucleotide deletion, substitution, including transition and transversion, or insertion, and wherein said alterations 
may occur at the 5' or 3' terminal positions of the reference nucleotide sequence or anywhere between those terminal 
positions, interspersed either individually among the nucleotides in the reference sequence or in one or more contiguous 
groups within the reference sequence. The number of nucleotide alterations is determined by multiplying the total 

' D , NO: 1 ° f SEQ 10 N0:5 bV <he nUmSriCal PerCent ° f the respective £J* identity 
(divided by 100) and subtracting that product from said total number of nucleotides in SEQ ID NO:1 or SEQ ID NO:5, or: 

n x Sx n-< x „*y). 

wherein n n is the number of nucleotide alterations, x„ is the total number of nucleotides in SEQ ID NO' 1 or SEQ ID 
NO:5, and y is, for instance, 0.70 for 70%, 0.80 for 80%, 0.85 for 85%, 0.90 for 90%, 0.95 for 95%,etc ' and wherein 
any non-integer product of x n and y is rounded down to the nearest integer prior to subtracting it from jL Alterations 
of a polynucleotide sequence encoding the polypeptide of SEQ ID NO:2 or SEQ ID NO:6 may create nonsense mis- 
sense or frameshift mutations in this coding sequence and thereby alter the polypeptide encoded by the polynucleotide 
following such alterations. Huiynuweuiiue 

K,r, ~ m l a rZ, a polype P ,ide sconce of the present invention may be identical to the reference sequence of SEQ ID 
NO:2 or SEQ ID NO:6, that is be 100% identical, or it may include uptoacertain integer number of amino acid alterations 
as compared to the reference sequence such that the % identity is less than 100%. Such alterations are selected from 
the group consisting of at least one amino acid deletion, substitution, including conserve and non^onseivative 
substitution, or insertion, and wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, interspersed either individually among 
the amino adds in the reference sequence or in one or more contiguous groups within the reference sequence The 

inToin MnT a So'!nt^ n c S K f0r 8 9iVSn % iden,i,y iS determined b V multiplying the total number of amino acids 
in SEQ ID NO.2 or SEQ ID NO:6 by the numerical percent of the respective percent identityfdivided by 100) and then 
subtracting that product from said total number of amino acids in SEQ ID NO:2 or SEQ ID NO:6, or: 

n a*V( x a*y). 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids in SEQ ID N0 2 or SEQ ID 
NO.6, and y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., and wherein any non-integer product of x 
and y is rounded down to the nearest integer prior to subtracting it from x.. 

"Hc-molog" is a generic term used in the art to indicate a polynucleotide or polypeptide sequence possessinq a 
high degree of sequence relatedness to a subject sequence. Such relatedness may be quantified by determining the 
degree of identity and/or similarity between the sequences being compared as hereinbefore described 

Falling with.n this generic term are the terms "ortholog", meaning a polynucleotide or polypeptide that is the func- 
tional equivalent of a polynucleotide or polypeptide in another species, and "paralog" meaning a functionally similar 
sequence when considered with.n the same species. Hence in the rat, for example, a member of the family of serotonin 
receptors is a paralog of the other members of the rat serotonin receptor family. 

B v fl Zl Si0 P D P rn e ! n c V! ,er f 10 a . Pr0,8in 6nC0ded by ,W0 ' ° ,,en unrela,ed ' fused 9 enes or fragments thereof. In one 
example, EP-A-0 464 discloses fusion proteins comprising various portions of constant region of immunoglobulin mol- 
ecules together wrth another human protein or part thereof. In many cases, employing an immunoglobulin Fc region 
as a part of a fusion protein is advantageous for use in therapy and diagnosis resulting in, for example, improved 
pharmacokinetic properties [see, e.g., EP-A 0232 262]. On the other hand, for some uses it would be desirable to be 
able to delete the Fc part after the fusion protein has been expressed, detected and purified 

All publications, including but not limited to patents and patent applications, cited in this specification are herein 
incorporated by reference as if each individual publication were specifically and individually indicated to be incorporated 
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by reference herein as though fully set forth. 
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SEQUENCE INFORMATION 
SEQ ID NO:l 

TAGAAGCCGGCTGTCGGTCTCCGTGTCGCCGCCGCCGCCCGGCATCGTGGAGCTGGGGCCCCCTTTTGCCTG 

ggagttttgtagtcgcctagggtcagcggtgacatcccaaagggcaggcccggcagccgccatggtggccaa 

GGAITACCCCTTCTACCTCACGGTCAAGAGAGCGAACTGCAGCCTGGAGCTACCCCCGGCCAGCGGTCCGGC 

CAAGGACGCTGAGGAGCCTAGTAATAAACGGGTCAAACCCCTTTCCCGAGTCACGTCGCTAGCAAACCTCAT 

CCCGCCCGTGAAGGcCACGCCATTAAAGCGCTTCAGTCAAACCCTGCAGCGCTCCATTAGCTTCCGCAGTGA 

GAGCCGCCCTGACATCCTCGCCCCCCGACCCTGGTCCAGAAATGCCGCCCCCTCGAGCACGAAACGGAGAGA 

TAGCAAGCTGTGGAGTGAGACCTTCGATGTGTGCGTCAATCAGATGCTTACATCCAAGGAAATCAAACGTCA 

GGAGGCGATCTTTGAGCTTTCCCAAGGAGAAGAAGACTTGATAGAAGACTTGAAATTAGCAAAAAAGGCCTA 

TCATGACCCCATGCTGAAACTCTCCATAATGACAGAACAAGAGTTGAATCAAATTTTTGGAACACTGGACTC 

TCTAATTCCTCTACATGAAGAGCTCCTTAGTCAGCTTCGAGATGTTAGGAAGCCTGATGGCTCGACTGAACA 

TGTTGGTCCCATCCTCGTGGGCTGGCTCCCTTGCCTCAGCTCCTATGATAGCTACTGCAGCAATCAAGTAGC 

CGCCAAAGCTCTGCTGGACCACAAAAAGCAAGATCACCGAGTCCAGGATTTCCTACAGCGATGTTTAGAATC 

CCCCTTTAGCCGCAAACTAGATCTCTGGAATTTCCTCGATATTCCAAGAAGCCGCCTGGTAAAATACCCTCT 

GCTTCTCCGAGAAATCTTGAGGCACACACCAAATGATAATCCAGATCAGCAGCACTTGGAAGAAGCTATAAA 

TATCATTCAGGGAATTGTGGCAGAAATCAACACCAAGACTGGTGAATCTGAATGCCGCTATTATAAAGAGCG 

GCTTCTTTACTTGGAAGAAGGCCAGAAAGACTCCCTGATCGACAGCTCTCGAGTCTTGTGTTGTCATGGTGA 

ACTGAAGAACAATCGGGGCGTGAAACTGCATGTTTTCCTGTTCCAAGAAGTGCTTGTGATCACTCGAGCCGT 

CACCCACAATGAGCAGCTTTGCTACCAGCTGTACCGTCAGCCAATCCCCGTGAAAGACCTCCTGCTGGAAGA 

CCTCCAGGATGGAGAAGTGAGGCTGGGTGGCTCCCTGCGAGGGGCATTCAGCAACAATGAGAGAATTAAAAA 

CTTCTTCAGAGTCAGTTTCAAAAATGGATCCCAAAGTCAGACCCACTCGCTACAAGCCAATGACACTTTCAA 

CAAACAGCAGTGGCTTAACTGTATTCGTCAAGCCAAAGAAACAGTTTTGTGTGCTGCCGGGCAAGCTGGGGT 

GCTTGACTCCGAGGGATCGTTCCTAAATCCCACCACCGGGAGCAGAGAGCTACAGGGAGAAACAAAACTTGA 

GCAGATGGACCAATCGGACAGTGAGTCAGACTGTAGTATGGACACGAGTGAGGTCAGCCTCGACTGTGAGCG 

CATGGAACAGACAGACTCTTCCTGTGGAAACAGCAGGCACGGTGAAAGTAACGTCTGACAGAAGCATGTGCA 

CTTCGGGAAGCAGGCCTGCATCTTACCTGTACAGTATTTGCATTCCACAGATGGAACGGTTTGGAGAAGCAC 

TTTTTCATACTTTTGTGAAAGTATACATGTTGGCCCAGTCTCTCGTATCTGTACCTTTGTCCCTAGTACTGT 

AACTGCCAATCTGTCTGTGTAAGCTGGAATCTGTGGCAACTATTACCCTGTGTTGTATTTCCCAAGTGTCTG 

GATGGATGGAGAGGTACTCAAACAAGTTACTTTCAGTTGTCCTGCTGGATTTTAAAAAAATAGAAAAAGAAT 

CTCAAAACTACTGTTTTACATAGATTGTTTGAAGAGTCCTTCCTCTTGTGCTTCTGTACCACTTTCCCAGCT 

CTTAGATGTGGTAGCTAAAGGCACGGAATTTAGACGGCCTTGTAAATAGGGCATGAGGAACTCATCTGTGTA 

TTGGGATGGTATTAGAGAGAGAATCAGGAAAGACCAACTCATGAAGTGAACTTGGTTTGATCTTACTCAACT 

AGAAAGCTTGAAAACATCCCTGGGGATTCTGAAGGCTTAATTTTGCAAAGGAGGATGCATTGTCTGAACTTT 

GCAACTTCATCCAGTGCAAGTTTGATGCAAGAATGTATTAGGACATAAAATAGAGGCTGACCTTAAAAGGGC 

CAGGACAGAAGCGGCTGCCAGCTCTGAATCTTTAACTGAAATGCACATGGCACCAGGAGGTGTCTCTCATAG 

TTGGTTGCTAGCCTAAAACATCAGAATAGAACCCAAAGGGCTTAGGAAGGCCTGCCAGGATAACAAGAAGGC 

CCTGTATTCATTGTGTTTCATCTGCCTAGGCCTACTCATTATTTTAGAGAATGAATGAAGCAACAAGGAAGA 

GAGACCATGACTCTATCGATGACACTGTTTATAGAAACACAGGAGAGGAAGAATTTGGAATGAAAAGCACTT 

CGTCAGAACCTTCTGTGGGAGCCATTGAGAGAAAAGCATGGTCCAGTGCCTTCTGAGAAAGGCCAGAGCTTT 

GGGCTTTCCTGCTCTGCTTTTGGGTCGTCAATTTGCCATCTCTGGTTCTGTGCTATAATCAGAATTGTAATT 

ATGTTCTCCAGAGGCCAATTTCATTAACTCTGATTAATTAGAATCAGCTAGCCAGATTAGTAACCTCTTTGT 

CCAGCCTTGATTTACAGTGCAGGGTAAAGTGCAGACCTTAAAAACAGCTAAGTACCTAGAAGAGCTCCCTGC 

AAGTGTAAATATTAAGGATGACCTGTGCAAAATTATACCCACACCAGCACTAGTGGTAATTATTCTAAATTA 
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TTGCCAAAAAGTTTTTTTTAATCTGTCTTTCAAGTTTACAGAAAAGAAAGCAGTAAATGCATTGATGTCATT 
TTATTATGTACATATATCATGTGCATTCAAGCTGTGTGACAAGATATATCAATATAAAAACAAGGTATATAC 
TTTATTATTTTTTGAAAACAAGGATATTGTGATCAATTTTACCCTGTAAAACATATTTCTGTATTTATAGGT 
CTTAAACATGATGAATTTTTTCTATTACAAGTTTATTTAAAACTGCTTTCTCAAGTCGTTATTGATACAGCA 
AGTGAACCTGCTGCAGACAGAAGCAGAGGAAAGCCAAGAACAGCCTTTATTGGTGAAGAAAAGAATGAATGA 
TTCTTTGTAGGCGCCATCAGCCACTTTTAGAAGCCATCAGCCAGTGTGTTGGGAAAAGAGGTTTGTCAAGTG 
TTGGCCTATGGGAAGGTGGTCAATGAATGTTTTGATGAAATGAATGTTTTTGTATAATGGCCTTAAACTTTT 
CTGGAAGTATTTCAAATAAATTACATTATTAAGTCAAAAAAAAAJW 

SEQIDN0:2 

MVAKDYPFYLTVKRANCSLELPPASGPAKDAEEPSNKRVKPLSRVTSLANLIPPVKATPLKRFSQTLQRSIS 
FRSESRPDILAPRPWSRNAAPSSTKRRDSKLWSETFDVCVNQMLTSKEIKRQEAIFELSQGEEDLIEDLKLA 
KKAYHDPMLKLSIMTEQELNQIFGTLDSLIPLHEELLSQLRDVRKPDGSTEHVGPILVGWLPCLSSYDSYCS 
NQVAAKALLDHKKQDHRVQDFLQRCLESPFSRKLDLWNFLDIPRSRLVKYPLLLREILRHTPNDNPDQQHLE 
EAINIIQGIVAEINTKTGESECRYYKERLLYLEEGQKDSLIDSSRVLCCHGELKNNRGVKLHVFLFQEVLVI 
TRAVTHNEQLCYQLYRQPIPVKDLLLEDLQDGEVRLGGSLRGAFSNNERIKNFFRVSFKNGSQSQTHSLQAN 
DTFNKQQWLNCIRQAKETVLCAAGQAGVLDSEGSFLNPTTGSRELQGETKLEQMDQSDSESDCSMDTSEVSL 
DCERMEQTDSSCGNSRHGESNV 

SEQIDN0:3 

ATCTTTGAGCTTTCCCAAGGAGAAGAAGACTTGATAGAAGACTTGAAATTAGCAAAAAAGGCCTATCATGAC 

CCCATGCTGAAACTCTCCATAATGACAGAACAAGAGTTGAATCAAATTTTTGGAACACTGGACTCTCTAATT* 

CCTCTACATGAAGAGCTCCTTAGTCAGCTTCGAGATGTTAGGAAGCCTGATGGCTCGACTGAACATGTTGGT 

CCCATCCTCGTGGGCTGGCTCCCTTGCCTCAGCTCCTATGATAGCTACTGCAGCAATCAAGTAGCCGCCAAA 

GCTCTGCTGGACCACAAAAAGCAAGATCACCGAGTCCAGGATTTCCTACAGCGATGTTTAGAATCCCCCTTT 

AGCCGCAAACTAGATCTCTGGAATTTCCTCGATATTCCAAGAAGCCGCCTGGTAAAATACCCTCTGCTTCTC 

CGAGAAATCTTGAGGCACACACCAAATGATAATCCAGATCAGCAGCACTTGGAAGAAGCTATAAATATCATT 

CAGGGAATTGTGGCAGAAATCAACACCAAGACTGGTGAATCTGAATGCCGCTATTATAAAGAAGGCGGCTTC 

TTTACTTGGAAGAAGGCCAGAAAGACTCCCTGGATCGACAGCTCTCGAGTCTTGTGTTGTCATGGTGAACTG 

AAGAACAATCGGGGCAGAGAAATGCATGTTTTCCTGTTCCAAGAAGTGCTTGTGATCACTCGAGCCGTCACC 

CACAATGAGCAGCTTTGCTACCAGCTGTACCGTCAGCCAATCCCCGTGAAAGACCTCCTGCTGGAAGACCTC 

CAGGATGGAGAAGTGAGGCTGGGTGGCTCCCTGCGAGGGGCATTCAGCAACAATGAGAGAATTAAAAACTTC 

TTCAGAGTCAGTTTCAAAAATGGATCCCAAAGTCAGACCCACTCGCTTACAAGCCAATGGACACTTTTCAAC 

AAACAGCAGTGGCTTAACTGTATTCGTCAAGCCAAAGAAACAGTT 

SEQ ID N0:4 

IFELSQGEEDLIEDLKLAKKAYHDPMLKLSIMTEQELNQIFGTLDSLIPLHEELLSQLRDVRKPDGSTEHVG 
PILVGWLPCLSSYDSYCSNQVAAKALLDHKKQDHRVQDFLQRCLESPFSRKLDLWNFLDIPRSRLVKYPLLL 
REILRHTPNDNPDQQHLEEAINI IQGIVAEINTKTGESECRYYKEGGFFTWKKARKTPWIDSSRVLCCHGEL 
KNNRGREMHVFLFQEVLVITRAVTHNEQLCYQLYRQPIPVKDLLLEDLQDGEVRLGGSLRGAFSNNERIKNF 
FRVSFKNGSQSQTHSLTSQWTLFNKQQWLNCIRQAKETV 
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SEQ ID N0:5 

GTGAGTTTTCCAGCCTGGAGAGGCCGAACCAGCGCGGGGGTCGAGGAGACCTGAGCCTGCCATACTGAGTCT 

ATGGACAGTTCCACAGCAATGAATCAATGCAGCTGTAGAGGAATGGAAGAAAACAAGGAAAGGCCTAAACGG 

CAGCGGCAAAACAACTTTCCCATGTTTCCCTCTCCTAAAGCCTGGAATTTCAGAGGGCGAAAACGGAAACAG 

AGCACCCAAGATGAAGATGCTGTTAGCCTTTGCAGTCTCGACATAAGTGAGCCTAGTAATAAACGGGTCAAA 

CCCCTTTCCCGAGCCACGTCGCTAGCAAACCTCATCCCGCCCGTGAAGGCCACGCCATTAAAGCGCTTCAGT 

CAAACCCTGCAGCGCTCCATTAGCTTCCGCAGTGAGAGCCGCCCTGACATCCTCGCCCCCCGACCCTGGTCC 

AGAAATGCCGCCCCCTCGAGCACGAAACGGAGAGATAGCAAGCTGTGGAGTGAGACCTTCGATGTGTGCGTC 

AATCAGATGCTTACATCCAAGGAAATCAAACGTCAGGAGGCGATCTTTGAGCTTTCCCAAGGAGAAGAAGAC 

TTGATAGAAGACTTGAAATTAGCAAAAAAGGCCTATCATGACCCCATGCTGAAACTCTCCATAATGACAGAA 

CAAGAGTTGAATCAAATTTTTGGAACACTGGACTCTCTAATTCCTCTACATGAAGAGCTCCTTAGTCAGCTT 

CGAGATGTTAGGAAGCCTGATGGCTCGACTGAACATGTTGGTCCCATCCTCGTGGGCTGGCTCCCTTGCCTC 

AGCTCCTATGATAGCTACTGCAGCAATCAAGTAGCCGCCAAAGCTCTGCTGGACCACAAAAAGCAAGATCAC 

CGAGTCCAGGATTTCCTACAGCGATGTTTAGAATCCCCCTTTAGCCGCAAACTAGATCTCTGGAATTTCCTC 

GATATTCCAAGAAGCCGCCTGGTAAAATACCCTCTGCTTCTCCGAGAAATCTTGAGGCACACACCAAATGAT 

AATCCAGATCAGCAGCACTTGGAAGAAGCTATAAATATCATTCAGGGAATTGTGGCAGAAATCAACACCAAG 

ACTGGTGAATCTGAATGCCGCTATTATAAAGAGCGGCTTCTTTACTTGGAAGAAGGCCAGAAAGACTCCCTG 

ATCGACAGCTCTCGAGTCTTGTGTTGTCATGGTGAACTGAAGAACAATCGGGGCGTGAAACTGCATGTTTTC 

CTGTTCCAAGAAGTGCTTGTGATCACTCGAGCCGTCACCCACAATGAGCAGCTTTGCTACCAGCTGTACCGT 

CAGCCAATCCCCGTGAAAGACCTCCTGCTGGAAGACCTCCAGGATGGAGAAGTGAGGCTGGGTGGCTCCCTG 

CGAGGGGCATTCAGCAACAATGAGAGAATTAAAAACTTCTTCAGAGTCAGTTTCAAAAATGGATCCCAAAGT 

CAGACCCACTCGCTACAAGCCAATGACACTTTCAACAAACAGCAGTGGCTTAACTGTATTCGTCAAGCCAAA 

GAAACAGTTTTGTGTGCTGCCGGGCAAGCTGGGGTGCTTGACTCCGAGGGATCGTTCCTAAATCCCACCACC 

GGGAGCAGAGAGCTACAGGGAGAAACAAAACTTGAGCAGATGGACCAATCGGACAGTGAGTCAGACTGTAGT 

ATGGACACGAGTGAGGTCAGCCTCGACTGTGAGCGCATGGAACAGACAGACTCTTCCTGTGGAAACAGCAGG 

CACGGTGAAAGTAACGTCTGACAGAAGCATGTGCACTTCGGGAAGCAGGCCTGCATCTTACCTGTACAGTAT 

TTGCATTCCACAGATGGAACGGTTTGGAGAAGCACTTTTTCATACTTTTGTGAAAGTATACATGTTGGCCCA 

GTCTCTCGTATCTGTACCTTTGTCCCTAGTACTGTAACTGCCAATCTGTCTGTGTAAGCTGGAATCTGTGGC 

AACTATTACCCTGTGTTGTATTTCCCAAGTGTCTGGATGGATGGAGAGGTACTCAAACAAGTTACTTTCAGT 

TGTCCTGCTGGATTTTAAAAAAATAGAAAAAGAATCTCAAAACTACTGTTTTACATAGATTGTTTGAAGAGT 

CCTTCCTCTTGTGCTTCTGTACCACTTTCCCAGCTCTTAGATGTGGTAGCTAAAGGCACGGAATTTAGACGG 

CCTTGTAAATAGGGCATGAGGAACTCATCTGTGTATTGGGATGGTATTAGAGAGAGAATCAGGAAAGACCAA 

CTCATGAAGTGAACTTGGTTTGATCTTACTCAACTAGAAAGCTTGAAAACATCCCTGGGGATTCTGAAGGCT 

TAATTTTGCAAAGGAGGATGCATTGTCTGAACTTTGCAACTTCATCCAGTGCAAGTTTGATGCAAGAATGTA 

TTAGGACATAAAATAGAGGCTGACCTTAAAAGGGCCAGGACAGAAGCGGCTGCCAGCTCTGAATCTTTAACT 

GAAATGCACATGGCACCAGGAGGTGTCTCTCATAGTTGGTTGCTAGCCTAAAACATCAGAATAGAACCCAAA 

GGGCTTAGGAAGGCCTGCCAGGATAACAAGAAGGCCCTGTATTCATTGTGTTTCATCTGCCTAGGCCTACTC 

ATTATTTTAGAGAATGAATGAAGCAACAAGGAAGAGAGACCATGACTCTATCGATGACACTGTTTATAGAAA 

CACAGGAGAGGAAGAATTTGGAATGAAAAGCACTTCGTCAGAACCTTCTGTGGGAGCCATTGAGAGAAAAGC 

ATGGTCCAGTGCCTTCTGAGAAAGGCCAGAGCTTTGGGCTTTCCTGCTCTGCTTTTGGGTCGTCAATTTGCC 

ATCTCTGGTTCTGTGCTATAATCAGAATTGTAATTATGTTCTCCAGAGGCCAATTTCATTAACTCTGATTAA 

TTAGAATCAGCTAGCCAGATTAGTAACCTCTTTGTCCAGCCTTGATTTACAGTGCAGGGTAAAGTGCAGACC 

TTAAAAACAGCTAAGTACCTAGAAGAGCTCCCTGCAAGTGTAAATATTAAGGATGACCTGTGCAAAATTATA 

CCCACACCAGCACTAGTGGTAATTATTCTAAATTATTGCCAAAAAGTTTTTTTTAATCTGTCTTTCAAGTTT 
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ACAGAAAAGAAAGCAGTAAATGCATTGATGTCATTTTATTATGTACATATATCATGTGCATTCAAGCTGTGT 

GACAAGATATATCAATATAAAAACAAGGTATATACTTTATTATTTTTTGAAAACAAGGATATTGTGATCAAT 

TTTACCCTGTAAAACATATTTCTGTATTTATAGGTCTTAAACATGATGAATTTTTTCTATTACAAGTTTATT 

TAAAACTGCTTTCTCAAGTCGTTATTGATACAGCAAGTGAACCTGCTGCAGACAGAAGCAGAGGAAAGCCAA 

GAACAGCCTTTATTGGTGAAGAAAAGAATGAATGATTCTTTGTAGGCGCCATCAGCCACTTTTAGAAGCCAT 

CAGCCAGTGTGTTGGGAAAAGAGGTTTGTCAAGTGTTGGCCTATGGGAAGGTGGTCAATGAATGTTTTGATG 

AAATGAATGTTTTTGTATAATGGCCTTAAACTTTTCTGGAAGTATTTCAAATAAATTACATTATTAAGTCAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID N0:6 

MDSSTAMNQCSCRGMEENKERPKRQRQNNFPMFPSPKAWNFRGRKRKQSTQDEDAVSLCSLDISEPSNKRVK 
PLSRATSLANLIPPVKATPLKRFSQTLQRSISFRSESRPDILAPRPWSRNAAPSSTKRRDSKLWSETFDVCV 
NQMLTSKEIKRQEAIFELSQGEEDLIEDLKLAKKAYHDPMLKLSIMTEQELNQIFGTLDSLIPLHEELLSQL 
RDVRKPDGSTEHVGPILVGWLPCLSSYDSYCSNQVAAKALLDHKKQDHRVQDFLQRCLESPFSRKLDLWNFL 
DIPRSRLVKYPLLLREILRHTPNDNPDQQHLEEAINIIQGIVAEINTKTGESECRYYKERLLYLEEGQKDSL 
IDSSRVLCCHGELKNNRGVKLHVFLFQEVLVITRAVTHNEQLCYQLYRQPIPVKDLLLEDLQDGEVRLGGSL 
RGAFSNNERIKNFFRVSFKNGSQSQTHSLQANDTFNKQQWLNCIRQAKETVLCAAGQAGVLDSEGSFLNPTT 
GSRELQGETKLEQ^4DQSDSESDCSMDTSEVSLDCERMEQTDSSCGNSRHGESNV 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: SmithKline Beecham pic 
(ii) TITLE OF THE INVENTION: Novel Compounds 



(iii) NUMBER OF SEQUENCES : 6 

(lv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham 

<8) STREET: New Horizons Court, Great West Road 

(C) CITY: Brentford 

(D) STATE: 

(E) COUNTRY : UK 

(F) 2IP: TW8 9ET 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER : IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: 

TAGAAGCCGG CTGTCGGTCT CCGTGTCGCC 

CCCTTTTGCC TGGGAGTTTT GTAGTCGCCT 

CCCGGCAGCC GCCATGGTGG CCAAGGATTA 

CTGCAGCCTG GAGCTACCCC CGGCCAGCGG 

TAAACGGGTC AAACCCCTTT CCCGAGTCAC 

GGCCACGCCA TTAAAGCGCT TCAGTCAAAC 

GAGCCGCCCT GACATCCTCG CCCCCCGACC 

GAAACGGAGA GATAGCAAGC TGTGGAGTGA 

TACATCCAAG GAAATCAAAC GTCAGGAGGC 



SEQ ID NO: 1 : 

GCCGCCGCCC GGCATCGTGG AGCTGGGGCC 
AGGGTCAGCG GTGACATCCC AAA GGGCAGG 
CCCCTTCTAC CTCACGGTCA AGAGAGCGAA 
TCCGGCCAAG GACGCTGAGG AGCCTAGTAA 
GTCGCTAGCA AACCTCATCC CGCCCGTGAA 
CCTGCAGCGC TCCATTAGCT TCCGCAGTGA 
CTGGTCCAGA AATGCCGCCC CCTCGAGCAC 
GACCTTCGAT GTGTGCGTCA ATCAGATGCT 
GATCTTTGAG CTTTCCCAAG GAGAAGAAGA 
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CTTGATAGAA 


GACTTGAAAT 


TAGCAAAAAA 


GGCCTATCAT 


GACCCCATGC 


TGAAACTCTC 


600 




CATAATGACA 


GAACAAGAGT 


TGAATCAAAT 


TTTTGGAACA 


CTGGACTCTC 


TAATTCCTCT 


660 




ACATGAAGAG 


CTCCTTAGTC 


AGCTTCGAGA 


TGTTAGGAAG 


CCTGATGGCT 


CGACTGAACA 


720 


5 


TGTTGGTCCC 


ATCCTCGTGG 


GCTGGCTCCC 


TTGCCTCAGC 


TCCTATGATA 


GCTACTGCAG 


780 




CAATCAAGTA 


GCCGCCAAAG 


CTCTGCTGGA 


CCACAAAAAG 


CAAGATCACC 


GAGTCCAGGA 


840 




TTTCCTACAG 


CGATGTTTAG. 


AATCCCCCTT 


TAGCCGCAAA 


CTAGATCTCT 


GGAATTTCCT 


900 




CGATATTCCA 


AGAAGCCGCC 


TGGTAAAATA 


CCCTCTGCTT 


CTCCGAGAAA 


TCTTGAGGCA 


960 


10 


CACACCAAAT 


GATAATCCAG 


ATCAGCAGCA 


CTTGGAAGAA 


GCTATAAATA 


TCATTCAGGG 


1020 


AATTGTGGCA 


GAAATCAACA 


CCAAGACTGG 


TGAATCTGAA 


TGCCGCTATT 


ATAAAGAGCG 


1080 




GCTTCTTTAC 


TTGGAAGAAG 


GCCAGAAAGA 


CTCCCTGATC 


GACAGCTCTC 


GAGTCTTGTG 


1140 




TTGTCATGGT 


GAACTGAAGA 


ACAATCGGGG 


CGTGAAACTG 


CATGTTTTCC 


TGTTCCAAGA 


1200 




AGTGCTTGTG 


ATCACTCGAG 


CCGTCACCCA 


CAATGAGCAG 


CTTTGCTACC 


AGCTGTACCG 


1260 


15 


TCAGCCAATC 


CCCGTGAAAG 


ACCTCCTGCT 


GGAAGACCTC 


CAGGATGGAG 


AAGTGAGGCT 


1320 




GGGTGGCTCC 


CTGCGAGGGG 


CATTCAGCAA 


CAATGAGAGA 


ATTAAAAACT 


TCTTCAGAGT 


1380 




CAGTTTCAAA 


AATGGATCCC 


AAAGTCAGAC 


CCACTCGCTA 


CAAGCCAATG 


ACACTTTCAA 


1440 




CAAACAGCAG 


TGGCTTAACT 


GTATTCGTCA 


AG CC AAAGAA 


ACAGTTTTGT 


GTGCTGCCGG 


1500 


20 


GCAAGCTGGG 


GTGCTTGACT 


CCGAGGGATC 


GTTCCTAAAT 


CCCACCACCG 


GGAGCAGAGA 


1560 




' GCTACAGGGA 


GAAACAAAAC 


TTGAGCAGAT 


GGACCAATCG 


GACAGTGAGT 


CAGACTGTAG 


1620 




TATGGACACG 


AGTGAGGTCA 


GCCTCGACTG 


TGAGCGCATG 


GAACAGACAG 


ACTCTTCCTG 


1680 




TGGAAACAGC 


AGGCACGGTG 


AAAGTAACGT 


CTGACAGAAG 


CATGTGCACT 


TCGGGAAGCA 


1740 




GGCCTGCATC 


TTACCTGTAC 


AGTATTTGCA 


TTCCACAGAT 


GG AACGGTTT 


GGAGAAGCAC 


1800 


25 


TTTTTCATAC 


TTTTGTGAAA 


GTATACATGT 


TGGCCCAGTC 


TCTCGTATCT 


GTACCTTTGT 


1860 




CCCTAGTACT 


GTAACTGCCA 


ATCTGTCTGT 


GTAAGCTGGA 


ATCTGTGGCA 


ACTATTACCC 


1920 




TGTGTTGTAT 


TTCCCAAGTG 


TCTGGATGGA 


TGGAGAGGTA 


CTCAAACAAG 


TTACTTTCAG 


1980 




TTGTCCTGCT 


GGATTTTAAA 


AAAATAGAAA 


AAGAATCTCA 


AAACTACTGT 


TTTACATAGA 


2040 


30 


TTGTTTGAAG 


AGTCCTTCCT 


CTTGTGCTTC 


TGTACCACTT 


TCCCAGCTCT 


TAGATGTGGT 


2100 




AGCTAAAGGC 


ACGGAATTTA 


GACGGCCTTG 


TAAATAGGGC 


ATGAGGAACT 


CATCTGTGTA 


2160 




TTGGGATGGT 


ATTAGAGAGA 


GAATCAGGAA 


AGACCAACTC 


ATGAAGTGAA 


CTTGGTTTGA 


2220 




TCTTACTCAA 


CTAGAAAGCT 


TGAAAACATC 


CCTGGGGATT 


CTGAAGGCTT 


AATTTTGCAA 


2280 


35 


AGGAGGATGC 


ATTGTCTGAA 


CTTTGCAACT 


TCATCCAGTG 


CAAGTTTGAT 


GCAAGAATGT 


2340 


ATTAGGACAT 


AAAATAGAGG 


CTGACCTTAA 


AAGGGCCAGG 


ACAGAAGCGG 


CTGCCAGCTC 


2400 




TGAATCTTTA 


ACTGAAATGC 


ACATGGCACC 


AGGAGGTGTC 


TCTCATAGTT 


GGTTGCTAGC 


2460 




CTAAAACATC 


AGAATAGAAC 


CCAAAGGGCT 


TAGGAAGGCC 


TGCCAGGATA 


ACAAGAAGGC 


2520 




CCTGTATTCA 


TtGTGTTTCA 


TCTGCCTAGG 


CCTACTCATT 


ATTTTAGAGA 


ATGAATGAAG 


2580 


40 


CAACAAGGAA 


GAGAGACCAT 


GACTCTATCG 


ATGACACTGT 


TTATAGAAAC 


ACAGGAGAGG 


2640 




AAGAATTTGG 


AATGAAAAGC 


ACTTCGTCAG 


AACCTTCTGT 


GGGAGCCATT 


GAGAGAAAAG 


2700 




CATGGTCCAG 


TGCCTTCTGA 


GAAAGGCCAG 


AGCTTTGGGC 


TTTCCTGCTC 


TGCTTTTGGG 


2760 




TCGTCAATTT 


GCCATCTCTG 


GTTCTGTGCT 


ATAATCAGAA 


TTGTAATTAT 


GTTCTCCAGA 


2820 




GGCCAATTTC 


ATTAACTCTG 


ATTAATTAGA 


ATCAGCTAGC 


CAGATTAGTA 


ACCTCTTTGT 


2880 


CCAGCCTTGA 


TTTACAGTGC 


AGGGTAAAGT 


GCAGACCTTA 


AAAACAGCTA 


AGTACCTAGA 


2940 




AGAGCTCCCT 


GCAAGTGTAA 


ATATTAAGGA 


TGACCTGTGC 


AAAATTATAC 


CCACACCAGC 


3000 




ACTAGTGGTA 


ATTATTCTAA 


ATTATTGCCA 


AAAAGTTTTT 


TTTAATCTGT 


CTTTCAAGTT 


3060 




TACAGAAAAG 


AAAGCAGTAA 


ATGCATTGAT 


GTCATTTTAT 


TATGTACATA 


TATCATGTGC 


3120 


50 


ATTCAAGCTG 


TGTGACAAGA 


TATATCAATA 


TAAAAACAAG 


GTATATACTT 


TATTATTTTT 


3180 




TGAAAACAAG 


GATATTGTGA 


TCAATTTTAC 


CCTGTAAAAC 


ATATTTCTGT 


ATTTATAGGT 


3240 




CTTAAACATG 


ATGAATTTTT 


TCTATTACAA 


GTTTATTTAA 


AACTGCTTTC 


TCAAGTCGTT 


3300 




ATTGATACAG 


CAAGTGAACC 


TGCTGCAGAC 


AGAAGCAGAG 


GAAAGCCAAG 


AACAGCCTTT 


3360 


55 


ATTGGTGAAG 


AAAAGAATGA 


ATGATTCTTT 


GTAGGCGCCA 


TCAGCCACTT 


TTAGAAGCCA 


3420 


TCAGCCAGTG 


TGTTGGGAAA 


AGAGGTTTGT 


CAAGTGTTGG 


CCTATGGGAA 


GGTGGTCAAT 


3480 
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GAATGTTTTG ATGAAATGAA TGTTTTTGTA TAATGGCCTT AAACTTTTCT GGAAGTATTT 
CAAATAAATT ACATTATTAA GTCAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met val Ala Lys Asp Tyr Pro Phe Tyr Leu Thr Val Lys Arg Ala Asn 

15 10 15 

Cys Ser Leu Glu Leu Pro Pro Ala Ser Gly Pro Ala Lys Asp Ala Glu 

20 25 30 

Glu Pro Ser Asn Lys Arg Val Lys Pro Leu Ser Arg Val Thr Ser Leu 

35 40 45 

Ala Asn Leu lie Pro Pro Val Lys Ala Thr Pro Leu Lys Arg Phe Ser 

50 55 6 o 

Gin Thr Leu Gin Arg Ser lie Ser Phe Arg Ser Glu Ser Arg Pro Asp 
65 70 75 80 

He Leu Ala Pro Arg Pro Trp Ser Arg Asn Ala Ala Pro Ser Ser Thr 

85 go 95 

Lys Arg Arg Asp Ser Lys Leu Trp ser Glu Thr Phe Asp Val Cys Val 

100 105 no 

Asn Gin Met Leu Thr Ser Lys Glu lie Lys Arg Gin Glu Ala lie Phe 

115 120 125 

Glu Leu Ser Gin Gly Glu Glu Asp Leu lie Glu Asp Leu Lys Leu Ala 

130 135 i 4 o 

Lys Lys Ala Tyr His Asp Pro Met Leu Lys Leu Ser lie Met Thr Glu 
145 150 155 160 

Gin Glu Leu Asn Gin lie Phe Gly Thr Leu Asp Ser Leu He Pro Leu 

1^5 170 175 

His Glu Glu Leu Leu Ser Gin Leu Arg Asp Val Arg Lys Pro Asp Gly 

180 185 190 

Ser Thr Glu His Val Gly Pro lie Leu Val Gly Trp Leu Pro Cys Leu 

195 200 205 

Ser Ser Tyr Asp Ser Tyr Cys Ser Asn Gin Val Ala Ala Lys Ala Leu 

210 215 220 

Leu Asp His Lys Lys Gin Asp His Arg Val Gin Asp Phe Leu Gin Arg 
225 23 ° 235 240 

Cys Leu Glu Ser Pro Phe Ser Arg Lys Leu Asp Leu Trp Asn Phe Leu 

245 250 255 

Asp lie Pro Arg Ser Arg Leu Val Lys Tyr Pro Leu Leu Leu Arg Glu 
260 265 270 
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lie Leu Arg His Thr Pro Asn Asp Asn Pro Asp Gin Gin His Leu Glu 

275 280 285 

Glu Ala lie Asn He He Gin Gly He Val Ala Glu He Asn Thr Lys 

290 295 300 

Thr Gly Glu ser Glu Cys Arg Tyr Tyr Lys Glu Arg Leu Leu Tyr Leu 
305 310 315 320 

Glu Glu Gly Gin Lys- Asp Ser Leu lie Asp Ser Ser Arg Val Leu Cys 

325 330 335 

Cys His Gly Glu Leu Lys Asn Asn Arg Gly Val Lys Leu His Val Phe 

340 345 350 

Leu Phe Gin Glu Val Leu Val He Thr Arg Ala Val Thr His Asn Glu 

355 360 365 

Gin Leu Cys Tyr Gin Leu Tyr Arg Gin Pro He Pro Val Lys Asp Leu 

370 375 380 

Leu Leu Glu Asp Leu Gin Asp Gly Glu Val Arg Leu Gly Gly Ser Leu 
385 390 395 400 

Arg Gly Ala Phe Ser Asn Asn Glu Arg He Lys Asn Phe Phe Arg Val 

405 410 415 

Ser Phe Lys Asn Gly Ser Gin Ser Gin Thr His Ser Leu Gin Ala Asn 

420 -425 430 

Asp Thr Phe Asn Lys Gin Gin Trp Leu Asn Cys He Arg Gin Ala Lys 
25 435 440 445 

Glu Thr Val Leu Cys Ala Ala Gly Gin Ala Gly Val Leu Asp Ser Glu 

450 455 460 

Gly Ser Phe Leu Asn Pro Thr Thr Gly Ser Arg Glu Leu Gin Gly Glu 
470 m 475 480 

Thr Lys Leu Glu Gin Met Asp Gin Ser Asp Ser Glu Ser Asp Cys Ser 

485 490 495 

Met Asp Thr Ser Glu Val Ser Leu Asp Cys Glu Arg Met Glu Gin Thr 

500 505 510 

Asp Ser Ser Cys Gly Asn Ser Arg His Gly Glu Ser Asn Val 
515 520 525 



10 



15 



20 



30 



35 



40 



45 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs ■ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



50 < xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATCTTTGAGC TTTCCCAAGG AGAAGAAGAC TTGATAGAAG ACTTGAAATT AGCAAAAAAG 60 

GCCTATCATG ACCCCATGCT GAAACTCTCC ATAATGACAG AACAAGAGTT GAATCAAATT 120 

TTTGGAACAC TGGACTCTCT AATTCCTCTA CATGAAGAGC TCCTTAGTCA GCTTCGAGAT 180 

GTTAGGAAGC CTGATGGCTC GACTGAACAT GTTGGTCCCA TCCTCGTGGG CTGGCTCCCT 240 
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10 



15 



TGCCTCAGCT CCTATGATAG CTACTGCAGC AATCAAGTAG CCGCCAAAGC TCTGCTGGAC 
CACAAAAAGC AAGATCACCG AGTCCAGGAT TTCCTACAGC GATGTTTAGA ATCCCCCTTT 
AGCCGCAAAC TAGATCTCTG GAATTTCCTC GATATTCCAA GAAGCCGCCT GGTAAAATAC 

t"g C aa G gaa T g c T T C r AGAAAT CTTGAGGCAC ACACCAAATG ******* ™^ c c 

IZ^Zt CTATAAA ™ ""CAGGGA ATTGTGGCAG AAATCAACAC CAAGACTGGT 
GAATCTGAAT GCCGCTATTA TAAAGAAGGC GGCTTCTTTA CTTGGAAGAA GGCCAGAAAG 

GGCAG C A C c! GA TCGACAGCTC TCGAGTCTTG TCTTGTCATG GTGAAC — GAACAATCGG 
GGCAGAGAAA TGCATGTTTT CCTGTTCCAA GAAGTGCTTG TGATCACTCG AGCCGTCACC 
CACAATGAGC AGCTTTGCTA CCAGCTCTAC CGTCAGCCAA TCCCCGTGAA AGACC^G 
CTGGAAGACC TCCAGGATGG AGAAGTGAGG CTGGGTGGCT CCCTGCGAGG GGCATTCAGC 
AACAATGAGA GAATTAAAAA CTTCTTCAGA GTCAGTTTCA AAAATGGATC CCAAAGTCAG 
ACCCACTCGC TTACAAGCCA ATGGACACTT TTCAACAAAC AGCAGTGGCT TAACTGTATT 960 
CGTCAAGCCA AAGAAACAGT T ATT 960 

981 



(2) INFORMATION FOR SEQ ID NO : 4 : 

20 U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 < D > TOPOLOGY : linear 

(11) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4- 

JO 

lie Phe Glu Leu Ser Gin Gly Gl u Glu Asp Leu He Glu Asp Leu Lys 

5 ^ Q 

Leu Ala Lys Lys Ala Tyr His Asp Pro Met Leu Lys Leu Ser lie Met 
35 m . 20 25 • 30 

Thr Glu Gin Glu Leu Asn Gin lie Phe Gly Thr Leu Asp Ser Leu He 

Pro Leu His Glu Glu Leu Leu Ser Gin Leu Arg Asp Val Arg Lys Pro 

^ 5 60 
Asp Gly ser Thr Glu His Val Gly Pro lie Leu val Gly Trp Leu Pro 
70 7S 

80 

85 9 ° 95 

Ala Leu Leu Asp His Lys Lys Gin Asp His Arc, Val Gin Asp Phe Leu 

100 105 110 

Gin Arc Cys Leu Glu Ser Pro Phe Ser Ar g Lys Leu Asp Leu Trp Asn 

120 125 
Phe Leu Asp He Pro Ar g Ser Arg Leu Val Lys Tyr Pro Leu Leu Leu 

Arg Glu lie Leu Arg His Thr Pro Asn Asp Asn III Asp Gin Gin His 

150 155 

I- Glu Glu Ala lie Asn He lie Gin Gly He Val Ala Glu „. Z 

55 165 170 175 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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10 



15 



20 



180 185 190 

Phe Thr Trp Lys Lys Ala Arg Lys Thr Pro Trp lie Asp Ser Ser Arg 

195 200 205 

Val Leu Cya Cys His Gly Glu Leu Lys Asn Asn Arg Gly Arg Glu Met 

210 215 220 

His Val Phe Leu Phe Gin Glu Val Leu Val lie Thr Arg Ala Val Thr 
225 230 235 240 

His Asn Glu Gin Leu Cys Tyr Gin Leu Tyr Arg Gin Pro lie Pro Val 

245 250 255 

Lys Asp Leu Leu Leu Glu Asp Leu Gin Asp Gly Glu Val Arg Leu Gly 

260 265 270 

Gly Ser Leu Arg Gly Ala Phe Ser Asn Asn Glu Arg He Lys Asn Phe 

275 280 285 

Phe Arg Val Ser Phe Lys Asn Gly Ser Gin Ser Gin Thr His Ser Leu 

290. 295 300 

Thr Ser Gin Trp Thr Leu Phe Asn Lys Gin Gin Trp Leu Asn Cys He 
305 310 315 320 

Arg Gin Ala Lys Glu Thr Val 
325 



(2} - INFORMATION FOR.SEQ ID NO:5: 

25 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 3627 base pairs 
<B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 





GTGAGTTTTC 


CAGCCTGGAG 


AGGCCGAACC 


AGCGCGGGGG 


TCGAGGAGAC 


CTGAGCCTGC 


60 




CATACTGAGT 


CTATGGACAG 


TTCCACAGCA 


ATGAATCAAT 


GCAGCTGTAG 


AGGAATGGAA 


120 


40 


GAAAACAAGG 


AAAGGCCTAA 


ACGGCAGCGG 


CAAAACAACT 


TTCCCATGTT 


TCCCTCTCCT 


180 




AAAGCCTGGA 


ATTTCAGAGG 


GCGAAAACGG 


AAACAGAGCA 


CCCAAGATGA 


AGATGCTGTT 


240 




AGCCTTTGCA 


GTCTCGACAT 


AAGTGAGCCT 


AGTAATAAAC 


GGGTCAAACC 


CCTTTCCCGA 


300 




GCCACGTCGC 


TAGCAAACCT 


CATCCCGCCC 


GTGAAGGCCA 


CGCCATTAAA 


GCGCTTCAGT 


360 


45 


CAAACCCTGC 


AGCGCTCCAT 


TAGCTTCCGC 


AGTGAGAGCC 


GCCCTGACAT 


CCTCGCCCCC 


420 


CGACCCTGGT 


CCAGAAATGC 


CGCCCCCTCG 


AGCACGAAAC 


GGAGAGATAG 


CAAGCTGTGG 


480 




AGTGAGACCT 


TCGATGTGTG 


CGTCAATCAG 


ATGCTTACAT 


CCAAGGAAAT 


CAAACGTCAG 


540 




GAGGCGATCT 


TTGAGCTTTC 


CCAAGGAGAA 


GAAGACTTGA 


TAGAAGACTT 


GAAATTAGCA 


600 




AAAAAGGCCT 


ATCATCACCC 


CATGCTGAAA 


CTCTCCATAA 


TGACAGAACA 


AGAGTTGAAT 


660 


50 


CAAATTTTTG 


GAACACTGGA 


CTCTCTAATT 


CCTCTACATG 


AAGAGCTCCT 


TAGTCAGCTT 


720 




CGAGATGTTA 


GGAAGCCTGA 


TGGCTCGACT 


GAACATGTTG 


GTCCCATCCT 


CGTGGGCTGG 


780 




CTCCCTTGCC 


TCAGCTCCTA 


TGATAGCTAC 


TGCAGCAATC 


AAGTAGCCGC 


CAAAGCTCTG 


840 




CTGGACCACA 


AAAAGCAAGA 


TCACCGAGTC 


CAGGATTTCC 


TACAGCGATG 


TTTAGAATCC 


900 


55 


CCCTTTAGCC 


GCAAACTAGA 


TCTCTGGAAT 


TTCCTCGATA 


TTCCAAGAAG 


CCGCCTGGTA 


960 


AAATACCCTC 


TGCTTCTCCG 


AGAAATCTTG 


AGGCACACAC 


CAAATGATAA 


TCCAGATCAG 


1020 
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CAGCACTTGG 
ACTGGTGAAT 
AAAGACTCCC 
CGGGGCGTGA 
ACCCACAATG 
CTGCTGGAAG 
AGCAACAATG 
CAGACCCACT 
CGTCAAGCCA 
GGATCGTTCC 
CAGATGGACC 
GACTGTGAGC 
AACGTCTGAC 
TTGCATTCCA 
CATGTTGGCC 
TCTGTGTAAG 
ATGGATGGAG 
AGAAAAAGAA 
GCTTCTGTAC 
CCTTGTAAAT 
AGGAAAGACC 
ACATCCCTGG 
CAACTTCATC 
CTTAAAAGGG 
GCACCAGGAG 
GGGCTTAGGA 
CTAGGCCTAC 
TATCGATGAC 
GTCAGAACCT 
GCCAGAGCTT 
GTGCTATAAT 
TTAGAATCAG 
AAAGTGCAGA 
AAGGATGACC 
TGCCAAAAAG 
TTGATGTCAT 
CAATATAAAA 
TTTACCCTGT 
TACAAGTTTA 
CAGACAGAAG 
TCTTTGTAGG 
TTTGTCAAGT 
TTGTATAATG 
AAAAAAAAAA 



AAGAAGCTAT 
CTGAATGCCG 
TGATCGACAG 
AACTGCATGT 
AGCAGCTTTG 
ACCTCCAGCA 
AGAGAATTAA 
CGCTACAAGC 
AAGAAACAGT 
TAAATCCCAC 
AATCGGACAG 
GCATGGAACA 
AGAAGCATGT 
CAGATGGAAC 
CAGTCTCTCG 
CTGGAATCTC 
AGGTACTCAA 
TCTCAAAACT 
CACTTTCCCA 
AGGGCATGAG 
AACTCATGAA 
GGATTCTGAA 
CAGTGCAAGT 
CCAGGACAGA 
GTGTCTCTCA 
AGGCCTGCCA 
TCATTATTTT 
ACTGTTTATA 
TCTGTGGGAG 
TGGGCTTTCC 
CAGAATTGTA 
CTAGCCAGAT 
CCTTAAAAAC 
TGTGCAAAAT 
TTTTTTTTAA 
TTTATTATGT 
ACAAGGTATA 
AAAACATATT 
TTTAAAACTG 
CAGAGGAAAG 
CGCCATCAGC 
GTTGGCCTAT 
GCCTTAAACT 
AAAAAAAAAA 



AAATATCATT 
CTATTATAAA 
CTCTCGAGTC 
TTTCCTGTTC 
CTACCAGCTG 
TGGAGAAGTG 
AAACTTCTTC 
CAATGACACT 
TTTGTGTGCT 
CACCGGGAGC 
TGAGTCAGAC 
GACAGACTCT 
GCACTTCGGG 
GGTTTGGAGA 
TATCTGTACC 
TGGCAACTAT 
ACAAGTTACT 
ACTGTTTTAC 
GCTCTTAGAT 
GAACTCATCT 
CTGAACTTGG 
GGCTTAATTT 
TTGATGCAAG 
AGCGGCTGCC 
TAGTTGGTTG 
GGATAACAAG 
AGAGAATGAA 
GAAACACAGG 
CCATTGAGAG 
TGCTCTGCTT 
ATTATGTTCT 
TAGTAACCTC 
AGCTAAGTAC 
TATACCCACA 
TCTGTCTTTC 
ACATATATCA 
TACTTTATTA 
TCTGTATTTA 
CTTTCTCAAG 
CCAAGAACAG 
CACTTTTAGA 
GGGAAGGTGG 
TTTCTGGAAG 
AAAAAAA 



CAGGGAATTG 
GAGCGGCTTC 
TTGTGTTGTC 
CAAGAAGTGC 
TACCGTCAGC 
AGGCTGGGTG 
AGAGTCAGTT 
TTCAACAAAC 
GCCGGGCAAG 
AGAGAGCTAC 
TGTAGTATGG 
TCCT GTGGAA 
AAGCAGGCCT 
AGCACTTTTT 
TTTGTCCCTA 
TACCCTGTGT 
TTCAGTTGTC 
ATAGATTGTT 
GTGGTAGCTA 
GTGTATTGGG 
TTTGATCTTA 
TGCAAAGGAG 
AATGTATTAG 
AGCTCTGAAT 
CTAGCCTAAA 
AAGGCCCTGT 
TGAAGCAACA 
AGAGGAAGAA 
AAAAGCATGG 
TTGGGTCGTC 
CCAGAGGCCA 
TTTGTCCAGC 
CTAGAAGAGC 
CCAGCACTAG 
AAGTTTACAG 
TGTGCATTCA 
TTTTTTGAAA 
TAGGTCTTAA 
TCGTTATTGA 
CCTTTATTGG 
AGCCATCAGC 
TCAATGAATG 
TATTTCAAAT 



TGGCAGAAAT 
TTTACTTGGA 
ATGGTGAACT 
TTGTGATCAC 
CAATCCCCGT 
GCTCCCTGCG 
TCAAAAATGG 
AGCAGTGGCT 
CTGGGGTGCT 
AGGGAGAAAC 
ACACGAGTGA 
ACAGCAGGCA 
GCATCTTACC 
CATACTTTTG 
GTACTGTAAC 
TGTATTTCCC 
CTGCTGGATT 
TGAAGAGTCC 
AAGGCACGGA 
ATGGTATTAG 
CTCAACTAGA 
GATGCATTGT 
GACATAAAAT 
CTTTAACTGA 
ACATCAGAAT 
ATTCATTGTG 
AGGAAGAGAG 
TTTGGAATGA 
TCCAGTGCCT 
AATTTGCCAT 
ATTTCATTAA 
CTTGATTTAC 
TCCCTGCAAG 
TGGTAATTAT 
AAAAGAAAGC 
AGCTGTGTGA 
ACAAGGATAT 
ACATGATGAA 
TACAGCAAGT 
TGAAGAAAAG 
CAGTGTGTTG 
TTTTGATGAA 
AAATTACATT 



CAACACCAAG 
AGAAGGCCAG 
GAAGAACAAT 
TCGAGCCGTC 
GAAAGACCTC 
AGGGGCATTC 
ATCCCAAAGT 
TAACTGTATT 
TGACTCCGAG 
AAAACTTGAG 
GGTCAGCCTC 
CGGTGAAAGT 
TGTACAGTAT 
TGAAAGTATA 
TGCCAATCTG 
AAGTGTCTGG 
TTAAAAAAAT 
TTCCTCTTGT 
ATTTAGACGG 
AGAGAGAATC 
AAGCTTGAAA 
CTGAACTTTG 
AGAGGCTGAC 
AATGCACATG 
AGAACCCAAA 
TTTCATCTGC 
ACCATGACTC 
AAAGCACTTC 
TCTGAGAAAG 
CTCTGGTTCT 
CTCTGATTAA 
AGTGCAGGGT 
TGTAAATATT 
TCTAAATTAT 
AGTAAATGCA 
CAAGATATAT 
TGTGATCAAT 
TTTTTTCTAT 
GAACCTGCTG 
AATGAATGAT 
GGAAAAGAGG 
ATGAATGTTT 
ATTAAGTCAA 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3627 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 558 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asp Ser Ser Thr Ala Met Asn Gin Cys Ser Cys Arg Gly Met Glu 

15 10 15 

Glu Asn Lys Glu Arg Pro Lys Arg Gin Arg Gin Asn Asn Phe Pro Met 

. 20 25 30 

Phe Pro Ser Pro Lys Ala Trp Asn Phe Arg Gly Arg Lys Arg Lys Gin 

35 40 45 

Ser Thr Gin Asp Glu Asp Ala Val Ser Leu Cys Ser Leu Asp He Ser 

50 55 60 

Glu Pro Ser Asn Lys Arg Val Lys Pro Leu Ser Arg Ala Thr Ser Leu 
65 70 75 80 

Ala Asn Leu He Pro Pro Val Lys Ala Thr Pro Leu Lys Arg Phe Ser 

85 90 95 

Gin Thr Leu Gin Arg Ser He Ser Phe Arg Ser Glu Ser Arg Pro Asp 

100 105 110 

He Leu Ala Pro Arg Pro Trp Ser Arg Asn Ala Ala Pro Ser Ser Thr 

115 120 4 125 

Lys Arg Arg Asp Ser Lys Leu Trp Ser Glu Thr Phe Asp Val Cys Val 

130 135 140 

Asn Gin Met Leu Thr Ser Lys Glu He Lys Arg Gin Glu Ala He Phe 
145 150 155 160 

Glu Leu Ser Gin Gly Glu Glu Asp Leu He Glu Asp Leu Lys Leu Ala 

165 " 170 175 

Lys Lys Ala Tyr His Asp Pro Met Leu Lys Leu Ser lie Met Thr Glu 

180 185 190 

Gin Glu Leu Asn Gin He Phe Gly Thr Leu Asp Ser Leu He Pro Leu 

195 200 205 

His Glu Glu Leu Leu Ser Gin Leu Arg Asp Val Arg Lys Pro Asp Gly 

210 215 220 

Ser Thr Glu His Val Gly Pro He Leu val Gly Trp Leu Pro Cys Leu 
225 230 235 240 

Ser Ser Tyr Asp Ser Tyr Cys Ser Asn Gin Val Ala Ala Lys Ala Leu 

245 250 255 

Leu Asp His Lys Lys Gin Asp His Arg Val Gin Asp Phe Leu Gin Arg 

260 265 270 

Cy3 Leu Glu Ser Pro Phe Ser Arg Lys Leu Asp Leu Trp Asn Phe Leu 

275 280 285 

Asp He Pro Arg Ser Arg Leu Val Lys Tyr Pro Leu Leu Leu Arg Glu 

290 295 300 

He Leu Arg His Thr Pro Asn Asp Asn Pro Asp Gin Gin His Leu Glu 
305 310 315 320 

Glu Ala He Asn He He Gin Gly He Val Ala Glu He Asn Thr Lys 
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w 



15 



20 



25 



325 330 335 

Thr Gly Glu Ser Glu Cvs Am tv»- , 

u uys Arg Tyr Tyr Lys Glu Arg Leu Leu Tyr Leu 
34 0 o A c 

J45 350 
Glu Glu CUy Gl„ Lya Asp Ser Le(J u- ftsp ^ ^ ^ ^ ^ 

355 3S0 365 

Cys His Gly Glu Leu Lys Asn Asn Arg Gly Val Lys Leu His Val Phe 

375 380 
Leu PhG Gin Glu Val Leu Val lie Thr Arg Ala Val Thr Hig As „ ^ 

390 395 40Q 

Gin Leu Cys Tyr Gin Leu Tyr Arg Gln Pro rle PrQ Val Lys ^ ^ 

Leu Leu Glu A sp Leu Gin Asp Gly Glu ™ Arg Leu Qly Qly ™ ^ 

425 430 
Ar g Gly Ala Phe Ser Asn Asn „. Arg He Lys Asn Phe Phe Ar g Val 

Ser Phe Lys Asn Gly Ser Gln Z G1 „ Thr His ^ ^ ^ ^ ^ 

% 450 455 460 

Asp Thr Phe Asn Lys Gln Gln Trp Leu Asn Cys lie Ar g Gln Ala Lys 

^ 7 ^ 4 75 

Glu Thr Val Leu Cys Ala Ala riw ri« »i ^ 48 ° 
Y ia Ala Gl y Gln A la Gly Val Leu Asp ser Glu 

485 490 495 

Gly Ser Phe Leu Asn Pro Thr Thr Gly Ser Ar g Glu Leu Gln Gly Glu 

505 510 
Thr Lys Leu Glu Gln Met Asp Gln Ser Asp Ser Glu Ser Asp Cys Ser 

520 525 
Met Asp Thr Ser Glu Val Ser Leu Asp Cys Glu Ar g Met Glu Gln Thr 

535 540 
Asp ser Ser Cys Gly Asn Ser Arg His Gly Glu Ser Asn Val 

545 "0 555 



Claims 



1. An isolated polypeptide comprising an amino acid sequence which has at least 70°/ Wo „ tih , . ,„ 
sequence ot SEQ ID NO:2 or SEQ ID NO:6 over the aire length <! Ts^Tf^!^^^^ ^ 

2. An isolated polypeptide as claimed in claim 1 in which the amino acid sequence has a, leas, 9 5 o /o identity. 

3. The polypeptide as claimed in Cairn 1 comprising the amino acid sequence o, SEQ ID NO:2 or SEQ ID NO:6. 

4. The isolated polypeptide of SEQ ID NO:2 or SEQ ID NO:6. 

> w a**™^ i — SS^^SSL eM " 9 '**' n: ° r * maMU " 

7. ^i^^^m^,^,^^^^^^^^ 
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ID NO:1 or SEQ ID NO:5 over the entire length of SEQ ID NO:1 or SEQ ID NO:5 ; or a nucleotide sequence 
complementary to said isolated polynucleotide. 

8. The isolated polynucleotide as claimed in any one of claims 5 to 7 in which the identity is at least 95%. 

5 

9. An isolated polynucleotide selected from: 

(a) a polynucleotide comprising a nucleotide sequence encoding the polypeptide of SEQ ID NO:2; 

(b) the polynucleotide of SEQ ID NO:1 ; and 

(c) a polynucleotide obtainable by screening an appropriate library under stringent hybridization conditions 
with a labeled probe having the sequence of SEQ ID NO:1 or a fragment thereof; or a nucleotide sequence 
complementary to said isolated polynucleotide, 

(d) a polynucleotide comprising a nucleotide sequence encoding the polypeptide of SEQ ID NO:6; 

(e) the polynucleotide of SEQ ID NO:5; and 

(f) a polynucleotide obtainable by screening an appropriate library under stringent hybridization conditions 
with a labeled probe having the sequence of SEQ ID NO:5 or a fragment thereof; or a nucleotide sequence 
complementary to said isolated polynucleotide 

10. An expression system comprising a polynucleotide capable of producing a polypeptide of claim 1 when said ex- 
20 pression system is present in a compatible host cell. 

11. A host cell comprising the expression system of claim 15 or a membrane thereof expressing the polypeptide of 
claim 1 . 

25 12. A process for producing a polypeptide of claim 1 comprising culturing a host cell of claim 11 under conditions 
sufficient for the production of said polypeptide and recovering the polypeptide from the culture medium. 

13. An antibody immunospecific for the polypeptide of claim 1 . 

30 14. A method for screening to identify compounds which stimulate or which inhibit the function of the polypeptide of 
claim 1 which comprises a method selected from the group consisting of: 

(a) measuring the binding of a candidate compound to the polypeptide (or to the cells or membranes bearing 
the polypeptide) or a fusion protein thereof by means of a label directly or indirectly associated with the can- 

35 didate compound; 

(b) measuring the binding of a candidate compound to the polypeptide (or to the cells or membranes bearing 
the polypeptide) or a fusion protein thereof in the presense of a labeled competitior; 

(c) testing whether the candidate compound results in a signal generated by activation or inhbition of the 
polypeptide, using detection systems appropriate to the cells or cell membranes bearing the polypeptide; 

40 (d) mixing a candidate compound with a solution containing a polypeptide of claim 1 , to form a mixture, meas- 

uring activity of the polypeptide in the mixture, and comparing the activity of the mixture to a standard; or 
(e) detecting the effect of a candidate compound on the production of mRNA encoding said polypeptide and 
said polypeptide in cells, using for instance, an ELISA assay. 

45 15. An agonist or antagonist to the polypeptide of claims 1 to 4. 

16. A compound which is: 

(a) an agonist or antagonist to the polypeptide of claims 1 to 4; 
50 (b) isolated polynucleotide of claims 5 to 9; or 

(c) a nucleic acid molecule that modulates the expression of the nucleotide sequence encoding the polypeptide 
of claim 1; 

for use in therapy. 

55 

17. A process for diagnosing a disease or a susceptibility to a disease in a subject related to expression or activity of 
the polypeptide of claim 1 in a subject comprising: 



70 



15 
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(b)analyzingforthe presence or amount of said po^peptide expression in a sample derived from said subject. 

18. An isolated polynucleotide selected form the group consisting of: 

19. A polypeptide selected from the group consisting of: 

ttSXEXSttTSF — « - ™» ■*-» •» * - s E o ,o 

(b) a polypeptide in which the amino acid sequence has at least 70°/ \ri»n«u, t« « • 

SEQ ID NO:4 over the entire length of SEQ ID NO 4 * * 8m,n ° 3C,d S6qUence of 

(c) a polypeptide which comprises the amino acid of SEQ ID N04- 

(d) a polypeptide which is the polypeptide of SEQ ID NO 4 or ' ' 

(e) a polypeptide which is encoded by a polynucleotide comprising the sequence contained in SEQ ,D NO:3. 
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